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Slicing simple hydrocarbons 


IN BRIEF 
628 News at a glance 


IN DEPTH 


630 NEW CASE OF ALLEGED BULLYING 
ROCKS THE MAX PLANCK SOCIETY 
Colleagues say empathy researcher 
Tania Singer created an atmosphere of 
fear at her lab By K. Kupferschmidt 


631 CRITICS PAN EPA PLAN FOR WEIGHING 
TOXIC CHEMICAL RISKS 

Guidance could allow regulators 

to exclude key studies, researchers 
argue By V. Zainzinger 


632 Al TAKES ON VIDEO GAMES IN QUEST 
FOR COMMON SENSE 

Bots seek to hone strategy skills in 
matchup with top Dota 2 players 

By M. Hutson 


634 STUDY REIGNITES DEBATE ABOUT 
WHEN THERA BLEW ITS TOP 

Radiocarbon curve suggests 
archaeological data were right 

By L. Wade 

SCIENCE ADVANCES RESEARCH ARTICLE BY 

C. L. PEARSON ETAL. 10.1126/SCIADV.AAR8241; 
PODCAST 


SCIENCE sciencemag.org 


635 DETAILED GENOME MAPS PATHS TO 
BETTER WHEAT 
Chromosome-by-chromosome 

sequence helps navigate crop’s genomic 
complexity By E. Pennisi 

> RESEARCH ARTICLES P. 661 & 662; SCIENCE 
ADVANCES RESEARCH ARTICLE BY A. JUHASZ ET 
AL. 10.1126/SCIADV.AAR8602 


FEATURE 
636 TIDE OF LIES 


The researcher at the center of an 

epic scientific fraud remains an enigma 
to the scientists who exposed him 

By K. Kupferschmidt 


INSIGHTS 


PERSPECTIVES 


642 PLANT NUTRIENT ACQUISITION 
ENTICES HERBIVORE 

Maize defense metabolites enable iron 
uptake and attract herbivore attention 
By D. J. Kliebenstein 

> REPORT P. 694 


643 CHOOSING THE RIGHT INPUT IN 
CELL SIGNALING 

A decoding module discriminates 
ligands and signaling activation in the 
Wnt pathway By K. Kim and L. Goentoro 
> RESEARCH ARTICLE P. 663 


Published by AAAS 


645 CAGED CLUSTERS SHINE BRIGHTER 
The structure of small silver clusters 
confined in zeolites helps to explain their 
luminescent properties 

By M. Quintanilla and L. M. Liz-Marzan 

> REPORT P. 686 


646 PHASE SEPARATION FOCUSES 

DNA SENSING 

Liquid droplets form upon recognition 
of DNA to stimulate innate immune 
responses By A. Ablasser 

> REPORT P. 704 


647 PHOTOCATALYTIC UPGRADING 

OF NATURAL GAS 

Cerium salts catalyze formation of value- 
added molecules from methane and 
small alkanes By M. Kanai 

> REPORT P. 668 


648 TURNING UP TRANSLATION IN 

FRAGILE X SYNDROME 

Fragile X mental retardation protein 
promotes translation, contrary to its 
assumed function By S. Aryal and E. Klann 
> REPORT P. 709 


POLICY FORUM 


650 GLOBAL WARMING POLICY: 

IS POPULATION LEFT OUT IN THE COLD? 
Population policies offer options 

to lessen climate risks 

By J. Bongaarts and B. C. O’Neill 


BOOKS ETAL. 
653 AN ISLAND ON THE BRINK 


Convinced erosion, not climate change, 
threatens their home, a community 
grapples with an uncertain future 

By A. J. Miller 


654 OUR LOOMING LEAD PROBLEM 
Neglect, poor planning, and bad 
decisions led to Flint’s water crisis. 
It could easily happen again. 

By F. R. Davis 


LETTERS 


655 HARASSMENT CHARGES: 
ENOUGH HIMPATHY 
By J. Zelikova et al. 


655 HARASSMENT CHARGES: 
JOURNALISTS’ ROLE 
By J. Duffy 
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655 HARASSMENT CHARGES: 
INJUSTICE DONE? 
By A. Moya et al. 


656 HARASSMENT CHARGES: 
METOO BUT DUE PROCESS 
By K. R. Monroe 


RESEARCH 


IN BRIEF 


657 From Science and other journals 


REVIEW 
660 OPTICAL IMAGING 


Quantum-inspired computational 
imaging ¥. Altmann et al. 

REVIEW SUMMARY; FOR FULL TEXT: 
dx.doi.org/10.1126/science.aat2298 


RESEARCH ARTICLES 


WHEAT GENOME 

661 Shifting the limits in wheat 
research and breeding using a 
fully annotated reference genome 
International Wheat Genome 
Sequencing Consortium (IWGSC) 
RESEARCH ARTICLE SUMMARY; FOR FULL 
TEXT: dx.doi.org/10.1126/science.aar7191 


662 The transcriptional landscape of 
polyploid wheat 
R. H. Ramirez-Gonzdlez et al. 
RESEARCH ARTICLE SUMMARY; FOR FULL 
TEXT: dx.doi.org/10.1126/science.aar6089 


> NEWS STORY P. 635; SCIENCE ADVANCES 
RESEARCH ARTICLE BY A. JUHASZ ET AL. 


10.1126/SCIADV.AAR8602 


663 DEVELOPMENTAL SIGNALING 

A molecular mechanism for Wnt ligand- 
specific signaling M. Eubelen et al. 
RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
dx.doi.org/10.1126/science.aat1178 

> PERSPECTIVE P. 643 


REPORTS 
664 ORGANIC CHEMISTRY 


Stereodivergent synthesis of 
1,4-dicarbonyls by traceless 
charge-accelerated sulfonium 
rearrangement D. Kaldre et al. 


668 ORGANIC CHEMISTRY 

Selective functionalization of methane, 
ethane, and higher alkanes by cerium 
photocatalysis A. Hu et al. 

> PERSPECTIVE P. 647 


672 ACTIVE MATTER 

Collective clog control: Optimizing 
traffic flow in confined biological and 
robophysical excavation J. Aguilar et al. 


677 HIGH-PRESSURE PHYSICS 
Insulator-metal transition in 
dense fluid deuterium 

P. M. Celliers et al. 


682 MEMBRANES 
3D printed polyamide membranes 
for desalination M. R. Chowdhury et al. 


686 PHOTOLUMINENSCENCE 
Origin of the bright photo- 
luminescence of few-atom silver 
clusters confined in LTA zeolites 
D. Grandjean et al. 

> PERSPECTIVE P.645 


690 SOLID-STATE PHYSICS 

Twistable electronics with dynamically 
rotatable heterostructures 

R. Ribeiro-Palau et al. 


694 PLANT SCIENCE 

Plant iron acquisition strategy 
exploited by an insect herbivore 
L. Hu et al. 

> PERSPECTIVE P. 642 


697 CLIMATE 

Abrupt cloud clearing of marine 
stratocumulus in the subtropical 
southeast Atlantic S. E. Yuter et al. 


701 RNA METABOLISM 

Mixed tailing by TENT4A and TENT4B 
shields mRNA from rapid deadenylation 
J. Lim et al. 


704 IMMUNOLOGY 

DNA-induced liquid phase condensation 
of cGAS activates innate immune 
signaling M. Du et al. 

> PERSPECTIVE P. 646 


709 TRANSLATIONAL CONTROL 
Fragile X mental retardation 1 gene 
enhances the translation of large 
autism-related proteins 

E. J. Greenblatt and A. C. Spradling 

> PERSPECTIVE P. 648 


DEPARTMENTS 
627 EDITORIAL 


Don’t lose race-conscious policies 
By Liliana M. Garces 


718 WORKING LIFE 


The universe in a classroom 
By Jeffrey Maloy 


ON THE COVER 


=u sums. mest] Spikes of bread wheat 
S 1 ce== (Triticum aestivum), 
a hybrid of three 
f distinct genomes. 


Scientists have recently 
completed sequencing 
of a high-quality refer- 
aan ence wheat genome 
and identified how the 
three ancestral genomes work at a genetic 
level to form a staple of human nutrition. 
See pages 635, 661, and 662. 
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Don't lose race-conscious policies 


sing race as a factor in admission decisions by 
U.S. colleges and universities—so-called affir- 
mative action—has once again become a light- 
ning rod for debate. Last month, several uni- 
versities defended Harvard University against 
a lawsuit that attacks its use of race in student 
admissions. The backdrop of this case, and a 
similar one against the University of North Carolina- 
Chapel Hill, is even more disturbing. Also last month, 
the U.S. Department of Justice (DOJ) and Department 
of Education (DOE), both under leadership appointed 
by the Trump administration, 
encouraged “race-neutral” ad- 
missions practices, rolling back 
previous guidance for achiev- 
ing diversity by these very 
agencies. The U.S. Supreme 
Court has repeatedly allowed 
higher education institutions 
to consider race as one factor, 
among many, in admissions 
decisions. The rollback does 
not change that precedent. 
Two summers ago, the Su- 
preme Court ruled that col- 
leges and universities can use 
race in admissions decisions. 
This Fisher v. University of 
Texas decision added to a 
40-year-old line of Supreme 
Court precedent determining 
that the policy helped post- 
secondary institutions attain 
the educational benefits of 
diversity. In 2011, 2013, and 
2016, the DOJ and DOE is- 
sued guidance documents 
clarifying the implications of this precedent for higher 
education practitioners and administrators. Although 
not legally binding, these documents outlined the legal 
framework and actions that institutions could take to 
achieve diversity and advance their educational mission. 
The 2018 decision to rescind this guidance follows 
action last year by the DOJ to redirect its resources 
toward investigating claims of discrimination at insti- 
tutions that employ race-conscious admissions. These 
actions seek to deny the constitutionally endorsed tool 
that universities use to diversify and drive innova- 
tion—actions that improve public well-being and sci- 
entific inquiry. Is it a coincidence that the DOJ and 
DOE rollback was announced at a time when legal pro- 


“Negative consequences 
also extend 
beyond admissions...” 


ceedings on two affirmative action cases are taking place? 

As a legal scholar and researcher focused on access and 
equity in higher education, I have represented hundreds 
of social science researchers who have defended affirma- 
tive action before the Supreme Court. Those briefs joined 
over 100 other friend-of-the-court briefs filed in support 
of affirmative action in the Fisher case. In the face of a 
potential threat of legal action by the DOJ, it would be 
understandable for institutions to consider abandoning 
race-conscious policies. But doing so will have a dramatic 
and devastating effect on student body diversity and the 
racial climate of college cam- 
puses more generally. As pub- 
lic postsecondary institutions 
in states that ban affirmative 
action (so far, in eight states) 
have already experienced, the 
inability to consider race in 
admissions leads to substan- 
tial drops in the diversity of 
student bodies. These declines 
have taken place at public se- 
lective undergraduate  insti- 
tutions; in graduate fields of 
engineering, natural, and social 
sciences; and in law and medi- 
cal schools. At public medical 
schools across six states with 
bans, for example, the share 
of enrolled students who are 
students of color dropped from 
18.5% to 15.3% after the bans; 
in graduate engineering pro- 
grams, it dropped from 6.2% 
to 4.6%; and in the natural sci- 
ences, it fell from 7.8% to 6.3%. 
The negative consequences 
also extend beyond admissions, making it more difficult 
for administrators to talk about race and racism, and un- 
dermining actions that support the educational experi- 
ences of students already enrolled. 

The health of the nation and the rigor of its scien- 
tific inquiry require postsecondary administrators and 
leaders to respond to recent court actions and other 
legal intimidation by the Trump administration with 
a reinforced commitment to upholding student diver- 
sity. This commitment plays a critical role in preserv- 
ing democracy and bolstering the global economy. 
Sadly, today’s political climate has made efforts to re- 
alize these goals extremely difficult. 

-Liliana M. Garces 
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44 Federal scientists are doing the best they can, 
and they are remarkably resilient. 99 


Jacob Carter of the Union of Concerned Scientists, on its new survey in which scientists 
at 16 agencies describe working conditions under President Donald Trump's administration. 


I N B R I E F Edited by Jeffrey Brainard 


NATIONAL SECURITY 


Psychologists affirm detainee ban 


Military psychologists are barred from working with detained terrorism suspects. 


he American Psychological Association (APA) in Washington, 
D.C., voted last week to retain a policy banning military psycho- 
logists from working with detainees at Guantanamo Bay, the U.S. 
naval base in Cuba, and other national security detention facili- 
ties. But the 115,000-member organization is still at odds about 
the ethical rules of conduct for U.S. psychologists in the gov- 
ernment’s ongoing war against terrorism. APA’s ruling body rejected 
by a nearly two-to-one margin a proposal to let military psychologists 
provide mental health services to detainees who requested help; the 
proposal retained a ban on participating in interrogations. A request 
to create a task force to seek a compromise was also rejected. The de- 
bate, held on the eve of the group’s annual meeting in San Francisco, 
California, took place behind closed doors after some advocates of lift- 
ing the ban said they were being intimidated by opponents. 
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Two Salk lawsuits settled 


WORKPLACE | The Salk Institute for 
Biological Studies in San Diego, California, 
announced last week that it has settled out 
of court with two of three female profes- 
sors who sued it for gender discrimination 
last summer. In recent weeks, “productive 
conversations have led to a resolution of 
all claims between these parties that will 
enable us to put our disagreements behind 
us and move forward together at Salk,” 

the institute’s president, Rusty Gage, wrote 
in a joint statement with the two former 
plaintiffs, Kathy Jones, 63, and Victoria 
Lundblad, 65. Terms of the settlements 
were not disclosed. Both women will 
continue as full professors at the storied 
research center, said their lawyer, Deborah 
Dixon of Gomez Trial Attorneys in San 
Diego. A third plaintiff, Beverly Emerson, 
66, will continue to press her lawsuit, 
which is set to go to trial in December. 
Emerson left Salk after her contract was 
not renewed in December 2017. 


Court rejects EPA on insecticide 


PUBLIC HEALTH | AUS. federal appeals 
court last week ordered the Environmental 
Protection Agency (EPA) to prohibit a 
widely used insecticide linked to health 
problems in children, overruling a decision 
by President Donald Trump’s administration 
to reject the ban. In March 2017, then-EPA 
Administrator Scott Pruitt rejected a peti- 
tion to ban the pesticide, chlorpyrifos. His 
move reversed an EPA decision, made under 
then-President Barack Obama in 2015, to 
institute a ban. Health studies have shown 
that food residues containing chlorpyrifos 
were associated with neurodevelopmental 
disabilities in children. 


Japan med school favors men 


HIGHER EDUCATION | Tokyo Medical 
University (TMU) confirmed last week 
that for at least a decade, administrators 
slashed points from the entrance test scores 
of female applicants to ensure that enter- 
ing students—and, thus, graduates—were 
overwhelmingly male. University officials 
justified the policy by claiming women 
leave their careers after having children, 
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which would cause staffing shortages at 
TMU’ affiliated hospitals—a rationale that 
has infuriated Japanese women, educators, 
and even politicians. For the academic year 
beginning in April, 8.8% of male applicants 
were admitted, but the figure for female 
applicants was only 2.9%, leaving women 
making up only 18% of this year’s incom- 
ing class. The finding also raised suspicions 
about similar practices at other medical 
schools. Women made up only 20.3% of 
Japan’s practicing doctors in 2015, according 
to Organization for Economic Co-operation 
and Development (OECD) data. That ratio 
is the lowest among OECD’s 35 member 
countries, and less than half the average. 
The education ministry has promised to 
investigate admission procedures at all of 
the nation’s 80 medical schools. 


Gene-silencing drug approved 


DRUG DEVELOPMENT | The U.S. Food and 
Drug Administration last week issued its 
first approval for a drug that relies on RNA 
interference (RNAi)—a method of silencing 
disease-causing genes using small pieces 

of RNA. The technique won its discoverers 
a 2006 Nobel Prize, but has faced a rocky 
path during clinical trials because of side 
effects and the challenge of protecting RNA 
fragments from degradation in the body. 
The new drug, sold as Onpattro and devel- 
oped by Cambridge, Massachusetts—-based 
Alnylam Pharmaceuticals, is approved to 
treat nerve degeneration caused by heredi- 
tary transthyretin amyloidosis, a rare disease 
in which a misfolded liver protein builds 

up in the body and damages nerve and heart 
tissue. Clinical trials revealed only mild to 
moderate adverse events in people taking 
Onpattro versus the placebo, including 


swelling of the limbs and reactions at the 
site of infusion. Other RNAi-based treat- 
ments for harder-to-target tissues outside 
the liver are still under development. 


NIH harassment policy questioned 


WORKPLACE | Two Democratic lawmakers 
have criticized the U.S. National Institutes 

of Health (NIH) for what they call an un- 
acceptably lax policy on grant recipients who 
commit sexual harassment. In a 

6 August letter to NIH, Senator Patty Murray 
(D-WA) and Representative Rosa DeLauro 
(D-CT) note that its policy merely requires 
that grantee institutions certify they will 
comply with federal civil rights laws. NIH 
also requires institutions to report principal 
investigators (PIs) they put on administra- 
tive leave for any reason. In contrast, the 
National Science Foundation (NSF) will soon 
adopt a specific harassment policy; it would 
require institutions to report NSF-funded 
PIs found to have committed harassment 
and those put on administrative leave during 
an investigation. “[NIH] has largely failed 

to take steps to hold its awardee institutions 
accountable for fostering safe workplace 
environments regarding extramural 
harassment,” the lawmakers write. They 
requested details on NIH’s policies. 


YouTube clarifies science videos 


SCIENCE COMMUNICATION | As part of 
its growing effort to fight disinformation, 
Google’s YouTube has begun providing 
short blurbs offering science-based infor- 
mation, linked to longer explanations on 
Wikipedia, with its videos about global 
warming and the measles, mumps, and 
rubella vaccine. The annotations appear 
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below algorithmically selected videos that 
peddle climate and vaccine misinformation 
and also on fact-based series; they are part 
of a YouTube campaign targeting topics 
that have drawn conspiracy theorists, such 
as the Apollo moon landing. The blurb 

on climate videos says: “Multiple lines of 
scientific evidence show that the climate 
system is warming,” although it omits the 
significant fact that human greenhouse 

gas emissions have been the primary driver 
of that warming. 


Repairs begin for damaged scopes 


ASTRONOMY | The U.S. National Science 
Foundation (NSF) has provided $16.3 mil- 
lion for repairs now underway to two 
radio telescopes damaged by Hurricane 
Maria last September. Of that total, 

$2 million is for repairing a 240-ton radio 
telescope on St. Croix in the U.S. Virgin 
Islands that sustained wind and water 
damage to its electronics and supporting 
infrastructure; work began last week 

and is expected to be completed in 2 years. 
NSF also gave $14.3 million to repair 

the Arecibo Observatory, a giant radio 
telescope in Puerto Rico. The project 
began in May and is expected to take 
several months. The St. Croix station is the 
southernmost of 10 telescopes in an inter- 
ferometry array stretching across North 
America that sharpens images of cosmic 
objects by combining the telescopes’ 
signals. The array’s observations continue 
with nine stations, but the lack of data 
from St. Croix has degraded some images, 
and several experiments have been halted. 


S SCIENCEMAG.ORG/NEWS 
Read more news from Science online. 


Jury says weed killer caused cancer 


gricultural giant Monsanto must pay $289 million for failing to 

provide proper warnings of the alleged carcinogenic effects 

of glyphosate, the active ingredient in its widely used weed killer 

Roundup, a jury in San Francisco, California, decided last week. 

The civil verdict is the first of its kind; Monsanto faces hundreds 
of additional, similar lawsuits in other U.S. courts from plaintiffs 
who, like the one in the San Francisco case, allege they developed 
non-Hodgkin lymphoma after exposure to the chemical. Controversy 
over glyphosate intensified after the World Health Organization's 
cancer research agency in 2015 judged glyphosate “probably 
carcinogenic to humans.” But after reviewing safety studies, the U.S. 
Environmental Protection Agency and European regulators later 
deemed the product safe if properly used. Last week’s civil lawsuit 
turned on whether Monsanto should have labeled the product's 
potential risks more clearly. Monsanto said it plans to appeal. 
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New case of alleged bullying 
rocks the Max Planck Society 


Colleagues say empathy researcher Tania Singer created an 


atmosphere of fear at her lab 


By Kai Kupferschmidt, in Leipzig, Germany 


n 1 August, Martin Stratmann, presi- 

dent of Germany’s Max Planck Society 

(MPG) in Munich, emailed more than 

300 MPG institute directors about a 

growing concern at Germany’s basic 

research powerhouse. “Recent media 
reports on bullying and harassment at our 
Institutes are harming [MPG’s] reputation,” 
wrote Stratmann, who announced a special 
task force to address the issue. “We want to 
know if the events that have taken place at a 
few institutes are isolated cases or if we are 
dealing with structural problems.” 

At the time, only one such case was widely 
known: Guinevere Kauffmann, director of 
the Max Planck Institute for Astrophysics in 
Garching, stands accused of insulting and 
bullying students and making racist remarks. 
(Kauffmann has received coaching and mon- 
itoring, and now leads a drastically reduced 
group.) But now, another long-festering case 
has exploded into full view. Numerous for- 
mer or current co-workers have accused cel- 
ebrated neuroscientist Tania Singer, director 
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of the Max Planck Institute for Human Cog- 
nitive and Brain Sciences here—and one of 
the world’s foremost experts on empathy—of 
bullying and intimidating them. 

The researchers, all but one of whom 
insisted on remaining anonymous because 
they feared for their careers, describe a 
group gripped by terror of their boss. Many 
say Singer often had especially harsh words 
for women who became pregnant while 
working at the lab. Singer, who is now on a 
1-year sabbatical, declined to answer ques- 
tions for this article, but has acknowledged 
making mistakes in the past: “Problems as- 
sociated to my exhaustion due to having to 
carry and be responsible for [a] huge and 
complex study” were partly to blame, she 
wrote in a 12 February 2017 email to repre- 
sentatives of her department. But in a letter 
to Science, Singer’s lawyer strongly denies 
the bullying allegations. 

Coming shortly after the reports about 
the institute in Garching, published by 
Der Spiegel and BuzzFeed News Germany, 
Singer’s case has raised fresh questions 
about the way MPQ@’s 84 independent insti- 
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Tania Singer wants to show that meditation 
can make people more kind and caring. 


tutes operate. MPG, which has an annual 
budget of €1.8 billion, invests heavily in 
recruiting top researchers from around the 
world, then gives them an unusual amount 
of freedom to run their labs. Some have 
suggested that creates the circumstances in 
which bullying can easily occur. 

PhDnet, a network of Ph.D. students 
within MPG, said it would release a state- 
ment after Science went to press this week 
to propose ways to protect junior MPG sci- 
entists. The group will demand mandatory 
and regular leadership training sessions 
for anyone responsible for training early 
career researchers, for instance, and advi- 
sory committees to oversee Ph.D. theses, 
instead of just one senior scientist. Haakon 
Engen, a former lab member in Singer’s 
group who now works at the University of 
Mainz, says he didn’t have such a commit- 
tee: “That’s a really difficult situation to be 
in as it completely isolates you and puts 
you at the mercy of your [supervisor].” 

An MPG spokesperson said the soci- 
ety could not answer questions about the 
scandals because both Stratmann and Vice 
President Bill Hansson are on holiday this 
week. But in a 7 August statement, MPG ac- 
knowledged it has been trying to address the 
Singer case for more than a year. And in a 
14 July interview with German newspaper 
Frankfurter Allgemeine Zeitung, Stratmann 
said the Garching affair had shown that 
the society’s procedures for dealing with 
complaints did not work well. “I have to 
concede that, and for this reason we will 
improve it,” he said. 

Singer, the daughter of celebrated neuro- 
scientist Wolf Singer, helped found a new 
field called social neuroscience. She rose 
to prominence with her work on empathy, 
including a landmark study published in 
Science (20 February 2004, p. 1157) that 
showed watching a loved one experience 
pain activates the same brain areas as feeling 
physical pain directly. In 2013, she started 
an ambitious study, The ReSource Project, 
in which 160 participants were trained for 
9 months to demonstrate that meditation can 
make people more kind and caring. “She’s 
creative, she can be charming, she knows 
how to make contacts and get resources. It’s 
a gift and it was necessary to make a project 
like this happen,” one colleague says. “Her 
superpower is vision,’ another adds. 

But colleagues say working with Singer 
was always difficult. She wanted to control 
every detail but was often not available for 
discussion, they say, and in-person meetings 
could quickly turn into a nightmare. “She 
gets extremely emotional and when that 
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turns dark it is terrifying,’ one colleague 
says. “Whenever anyone had a meeting with 
her there was at least an even chance they 
would come out in tears.” 

Almost every current or former lab mem- 
ber brought up Singer’s treatment of preg- 
nant women. “People were terrified. They 
were really, really afraid of telling her about 
their pregnancies,” one former colleague 
says. Bethany Kok, a former lab member, 
says Singer reacted kindly when she first 
told her she was pregnant with twins. But 
the next day, Kok says, “She started scream- 
ing at me how she wasn’t running a charity, 
how I was a slacker and that I was going 
to work twice as hard for the time I would 
be gone.” A few weeks later, Kok says, she 
miscarried one of the twins and missed a lab 
meeting for an urgent medical appointment; 
she says Singer reproached her in an email. 

The issue was also on a 


20 scientists, has dwindled to just five.) The 
plan would take effect in January 2019, ac- 
cording to official minutes from the 25 July 
meeting, obtained by Science. 

Singer “has learned the lesson that groups 
that are too big carry the risk of losing con- 
tact with co-workers,” her lawyer writes. “A 
good work environment and dealing with 
each other respectfully are important to her.” 
But Singer’s colleagues say they are speaking 
out now to prevent her from regaining her 
management position. “It appears the Max 
Planck Society decided it would rather sacri- 
fice another generation of students than risk 
a scandal,” says one former colleague. Asked 
how MPG would ensure that future students 
are treated better, a spokesperson says de- 
tails of the plan are still being discussed. 

To investigate and adjudicate future con- 
flicts at MPG labs, the PhDnet paper ad- 

vocates establishing an 


list of grievances, shared independent committee 
with Science, that lab ¢ that would include early 
members say they brought We want to know career scientists and 
up in a meeting with the if the events ... are would be headed by a 
institute’s scientific ad- e e professional mediator, 
visory board in February isolated cases or if not an MPG employee. 
2017 and recorded after- ype are dealing with “Control over the arbi- 
ward. “Pregnancy and pa- 93 tration process, flow of 
rental leave are received structur al pr oblems. information and commu- 


badly and denied/turned 
into accusations,” the notes 
say. Other complaints aired 
at the meeting, according to lab members’ 
notes, included “emotional abuse, threats, 
devaluation of work, and personal abilities.” 

Singer’s lawyer wrote to Science that 
Singer never discriminated against preg- 
nant women or any other group, and said 
the events described by others “either did 
not happen or they happened very differ- 
ently than described.” The letter said Singer 
had “apologized deeply” during a 2017 me- 
diation process and had taken responsibil- 
ity for the problems, for instance by asking 
for the sabbatical and for the appointment 
of a temporary replacement. It also sug- 
gested that the allegations against her came 
from a “subgroup with its own strong inter- 
ests and group dynamics.” 

In its 7 August statement, MPG said 
Hansson had investigated the allegations, 
but that details are confidential. MPG said 
that “to calm the situation down,” it agreed 
to Singer’s sabbatical, which took effect 
in January. In a plan presented to the re- 
searchers on 25 July, MPG said it would 
separate Singer from her current colleagues 
and allow her to set up a new, smaller re- 
search group in Berlin for 2 to 3 years while 
the postdocs and Ph.D. students here fin- 
ish their projects and move on; Singer 
would then return to her lab. (The Leipzig 
group, which once numbered more than 


SCIENCE sciencemag.org 


Martin Stratmann, 
Max Planck Society 


nication has to always lie 
with the victim,” the doc- 
toral students say. “The 
president is open to a dialogue with PhDnet 
on this matter,’ the MPG spokesperson says. 

One problem is that MPG directors of- 
ten are very isolated, says biochemist and 
Nobel laureate Thomas Siidhof of Stanford 
University in Palo Alto, California, who 
held an MPG director’s post in the 1990s. 
“They don’t really have peers around them 
that can criticize them,” he says. “If I treated 
postdocs badly here in Stanford it would 
quickly be known, because I have a hundred 
colleagues whose postdocs talk to my post- 
docs.” But the MPG spokesperson argues 
that problems are rare among the society’s 
309 institute directors. “Two cases are less 
than 1%, you can hardly deduce a struc- 
tural problem from that.” 

Engen, the former member of Singer’s 
lab, says he used to think “there was some- 
thing horribly wrong with the MPG in par- 
ticular” but isn’t so sure today; MPG might 
just attract more media attention than uni- 
versities with similar problems, he wrote 
in an email. The underlying problem is 
that good scientists aren’t necessarily good 
leaders, Engen added. “What is critical is 
that there are safety valves and procedures 
that enable meaningful oversight over the 
people who are in charge, and that the 
people paying the bill (i.e. the politicians) 
engage with the problem.” 
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Critics pan 
EPA plan for 
weighing toxic 
chemical risks 


Guidance could allow 
regulators to exclude key 
studies, researchers argue 


By Vanessa Zainzinger 


cademic scientists and advocacy 

groups are urging the Environmental 

Protection Agency (EPA) to withdraw 

and rewrite proposed guidelines for 

determining which scientific find- 

ings to use when evaluating the 
safety of toxic chemicals. Critics say that, 
if adopted, the guidance will allow regu- 
lators to exclude high-quality health and 
risk studies for “ridiculous” reasons, favor 
industry-backed research, and prevent EPA 
from considering academic studies that rest 
on innovative methods. 

EPA’s guidance “is less about evaluating 
the quality of evidence, and more about elim- 
inating it altogether,’ the Natural Resources 
Defense Council (NRDC) of Washington, 
D.C., wrote this week to EPA in comments 
blasting the “flawed” proposal, which it says 
“describes a head-in-sand approach to any 
evidence that a toxic chemical is toxic.” EPA, 
however, says the guidelines are likely to 
evolve and that it is aiming for an “efficient 
systematic review process that generates 
high-quality, fit-for-purpose risk evaluations 
that rely on the best available science.” 

The controversy, which mirrors a debate 
over a proposal EPA released earlier this 
year that critics say would allow the agency 
to ignore certain human health studies, has 
its roots in a 2016 overhaul of the nation’s 
premier chemical safety law, the Toxic Sub- 
stances Control Act (TSCA). The revised law 
aims to make it easier for EPA to complete 
safety reviews of new chemicals before they 
reach the market and to more quickly re- 
strict the use of existing chemicals if new 
evidence of risks emerges. It also orders 
EPA to develop new guidelines for the “sys- 
tematic review” of the quality of the scien- 
tific evidence used in risk assessments. 

In response, EPA released a 248-page 
draft in May describing the review process 
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it is using for its first major round of chem- 
ical assessments under the new TSCA. The 
agency is focusing on 10 compounds al- 
ready in use, including asbestos (which is 
still used in the production of chlorine), 
carbon tetrachloride (often used to make 
refrigerants), and Violet 29, a common pig- 
ment. EPA notes the document offers only 
“general expectations” and that regulators 
can depart from the guidance. But it does 
appear to allow a publication to be ex- 
cluded if it does not include detailed meth- 
ods, for instance, or if specific information 
about study subjects or data sources is 
withheld because of privacy, business, or 
other concerns. 

A key industry group, the American 
Chemistry Council in Washington, D.C., 
said in a statement that the guidelines have 
“many positive attributes,” but would “ben- 
efit from additional explicit guidance,’ for 
example on how to make EPA’s process for 
integrating evidence into the rulemaking 
process more transparent. 

Critics worry that President Donald 
Trump’s appointees at EPA will use the guid- 
ance—which they note has not been inde- 
pendently reviewed—to put a pro-industry 
spin on evidence reviews, and perhaps 
make it easier for companies to keep dan- 
gerous chemicals on the market. And they 


to make fire-resistant fabrics. 
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U.S. regulators are reviewing the safety of asbestos, which is still used 


say it could make it easier for EPA to ig- 
nore data from certain kinds of health 
studies. “The things that they are exclud- 
ing [studies] on are ridiculous,” says Tracey 
Woodruff, director of the Program on Re- 
productive Health and the Environment at 
the University of California, San Francisco 
(UCSF), one of 25 academics and advocacy 
group officials who signed a letter to EPA 
criticizing the guidance. In particular, they 
say it places too much emphasis on how 
a study is described—or reported—in the 
scientific literature. 

Detailed information on methods and 
subjects is often left out of journal publica- 
tions, the critics note, and has “nothing to 
do with the quality of the study,’ Woodruff 
says. The approach, NRDC adds, mirrors 
the so-called transparency rule proposed 
this past April by former EPA Administra- 
tor Scott Pruitt; researchers argue it would 
bar the agency from using major epidemio- 
logical studies based on confidential health 
records (Science, 4 May, p. 472). 

In contrast, the critics note, the TSCA guid- 
ance fails to mention financial conflicts of in- 
terest as a reason for excluding a study. And 
they argue it inappropriately implies that 
studies done in industry-funded laboratories 
that adhere to specific standards, known as 
Good Laboratory Practices (GLP), are more 
trustworthy than studies done 
in academic laboratories that 
don’t follow GLP—often be- 
cause they are pioneering 
innovative methods (Science, 
10 February 2017, p. 564). 

Critics want EPA to scrap 
the draft guidance and re- 
place it with an established 
method for systematic review, 
such as one developed by the 
National Toxicology Program, 
which has already been peer 
reviewed. “I have not seen an 
explanation of why EPA’s ap- 
proach is so far outside of the 
scientific mainstream on sys- 
tematic reviews,” says Veena 
Singla, an associate director 
of UCSF’s health and environ- 
ment program. 

EPA is now deciding how 
to proceed. The deadline for 
public comment on the draft 
was 16 August. The guidelines 
would need to be revised 
soon, as the agency has said it 
wants to complete its 10 ini- 
tial chemical assessments by 
late next year. & 


Vanessa Zainzinger is a jour- 
nalist in the United Kingdom. 
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ARTIFICIAL INTELLIGENCE 


Al takes on 
video games 
in quest for 
common sense 


Bots seek to hone strategy 
skills in matchup with top 
Dota 2 players 


By Matthew Hutson 


ext week, scientists working on ar- 

tificial intelligence (AI) and games 

will be watching the latest human- 

machine matchup. But instead of a 

single pensive player squaring off 

against a computer, a team of five top 
video game players will be furiously casting 
magic spells and lobbing (virtual) fireballs 
at a team of bots called OpenAI Five. They’ll 
be playing the real-time strategy game Dota 
2 at The International in Vancouver, Can- 
ada, an annual e-sports tournament that 
draws professional gamers who compete for 
millions of dollars. 

In 1997, IBM’s Deep Blue AI bested chess 
champion Garry Kasparov. In 2016, Deep- 
Mind’s AlphaGo AI beat Lee Sedol, a world 
master, at the traditional Chinese board 
game Go. Computers have also defeated 
humans in checkers and some forms of 
poker. But fast-paced multiplayer video 
games pose a different kind of challenge, 
requiring computers to collaborate and 
manage unpredictability. The goal is com- 
mon sense, which could help Als handle 
real-world situations such as navigating 
traffic and providing home care—even if 
they never have to face a magic spell. 

“The next big thing for AI is collabora- 
tion,” says Jun Wang, a computer scientist 
at University College London who works 
on StarCraft IT, another real-time strategy 
game. That requires “strategic reasoning, 
where it’s understanding the incentives of 
others,” says Jakob Foerster, a computer 
scientist at the University of Oxford in the 
United Kingdom, who also works on Star- 
Craft IT. 

Dota 2, released in 2013, has millions of 
players around the world. In a game, teams 
fight to destroy a structure on their ene- 
my’s turf while defending their own, all the 
while collecting resources to increase their 
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strength and skills. A well-matched game 
lasts about 45 minutes. A year ago, OpenAI, 
a research nonprofit based in San Fran- 
cisco, California, revealed an AI that could 
beat the best human players in one-on- 
one games. But the five-on-five matchups 
showcased at The International present a 


much bigger challenge for a 
computer because the games 
are longer and more chaotic, 


So instead of relying on search trees, 
OpenAI Five uses neural networks, algo- 
rithms inspired by the brain that strengthen 
connections between small computing ele- 
ments in response to feedback. (AlphaGo 
combined neural nets with search trees.) 
During training, the system blindly experi- 
ments with different moves 
in the game. When they per- 
form well, the connections 


6 ° 
says Greg Brockman, Open- Tt sucks getting responsible for those acts 
Al’s co-founder and chief embarrassed by are reinforced. After thou- 
technology officer. Still, in sands of years of (sped-up) 
a warmup exhibition last a nonper. SON. gameplay, strong strategies 
ai oe nih easily We were just sera nae pair this 
eat a team of former pro s method, known as reinforce- 
players. “It sucks getting em- getting cr ushed ment learning, on a massive 


barrassed by a nonperson,” 
says William “Blitz” Lee, who 
lost on stage in front of a live 
audience. “We were just get- 
ting crushed left and right.” 
The range of possible moves in Dota 2 is 
far greater than in chess or Go, where each 
move has at most a few hundred options. In 
Dota 2, the action is constant, and players 
have thousands of options per move—where 
to flee, which spell to use, where to aim it. 
Such freedom, combined with the game’s in- 
herent randomness and players’ ignorance 
about what’s out of view, means you can’t 
perfectly predict what the game will look like 
even one move ahead. In chess and Go, algo- 
rithms use search trees, analyzing branching 
possibilities far into the future. In Dota 2, 
forecasts become fuzzy much more quickly. 
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left and right.” 


William “Blitz” Lee, 
Dota 2 player 


scale, running the algorithm 
on thousands of computers 
at once. “OpenAI Five is one 
of the most impressive dem- 
onstrations of reinforcement 
learning I have seen,” says Niels Justesen, 
a computer scientist at the IT University of 
Copenhagen who also works on StarCraft IT. 

Parsing last week’s warmup performance, 
Michael Cook, a computer scientist at Fal- 
mouth University in the United Kingdom 
who studies AI and games, says OpenAI Five 
excels by relying on a “superhuman ability 
to calculate the outcome of certain actions,” 
such as the damage a particular attack 
will inflict on an opponent. OpenAI handi- 
capped it to have the same reaction time as 
a human player, about a fifth of a second, 
but in that blink the system processes much 


Published by AAAS 


In warmup matches, OpenAl Five defeated human 
players at the multiplayer strategy game Dota 2. 


more information. Such thoroughness and 
precision make it deadly at fights, leading 
to “a Blitzkrieg-like approach to the game,” 
Cook says. “It’s amazing to watch.” But the 
aggressiveness may mask a weakness in 
long-term strategizing, Cook suggests: In 
one game that it lost, the AI was assigned 
characters that needed more time to build 
up their abilities, and it couldn’t adjust. 

On the face of it, OpenAI Five also ap- 
pears to succeed at collaboration. The AT’s 
five players were quite willing to be killed 
for the overall good of the team, which 
might confer an advantage over the human 
teams. “The bot plays very sacrificially,’ Lee 
says. Humans are less likely to give up a 
player in order to win, he says. “It’s a very 
human concept to be greedier.” But the AI 
relies on a kind of hive mind that may make 
coordination easier. Each of five nearly 
identical algorithms in the system gets a 
peek at what the others see, whereas hu- 
mans see only what’s on their own screens 
and share information only by talking. To 
collaborate with people or programs un- 
like themselves, whether in games or in life, 
Wang says the algorithms will eventually 
need to develop communication skills and 
“theory of mind”—models of the beliefs and 
desires of other people and algorithms. 

StarCraft IT, the game many AI research- 
ers prefer to work on, may be better for 
honing long-term planning. It is more like 
economics, Wang says, as it entails manag- 
ing a colony’s resources as it builds weapons 
factories. And other games may provide bet- 
ter tests of an Al’s ability to model the mind 
of an opponent than Dota 2; Foerster men- 
tions strategy board games such as Settlers 
of Catan and Risk, where multiple players 
must negotiate, trade, and form alliances, 
both cooperating and competing. 

Even so, Dota 2 remains a worthy test for 
an AI. Many experts are expecting Open- 
AI Five to win at The International. But 
Vanessa Volz, a computer scientist at the 
Technical University of Dortmund in Ger- 
many who studies AI and games, sees a po- 
tential weakness, in that OpenAI Five uses 
self-play to train its algorithms. “This ap- 
proach has the risk of being vulnerable to 
previously unseen playing styles,” she says. 
Lee, who lost to the AI, feels the same way. 
“Right now, the bot is a little too rigid,’ he 
says. “It’s starting to get a little too predict- 
able. I feel like if we’d had a few more games 
we would have been able to take games off 
it pretty cleanly.” 


Matthew Hutson is a journalist in 
New York City. 
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ARCHAEOLOGY 


Study reignites debate about 
when Thera blew its top 


Radiocarbon curve suggests archaeological data were right 


By Lizzie Wade 


undreds of years before the Trojan 

War, the volcanic island of Thera 

in the Aegean Sea blew its top in an 

explosion that rocked the ancient 

world. Sixty times greater than the 

1980 eruption of Mount St. Helens in 
Washington, the blast completely buried the 
Theran town of Akrotiri and sent 12-meter- 
high tsunamis hurtling toward the heart of 
the Minoan civilization on Crete, 110 kilo- 
meters to the south. Some authors have even 
speculated that the Atlantis myth may stem 
from a cultural memory of the cataclysm. 

But when exactly did it happen? The erup- 

tion spread ash across the eastern Mediter- 
ranean, so a precise date could pin down the 
chronologies of ancient cultures including the 
Greeks, Minoans, and Egyptians. Archaeo- 
logists and radiocarbon daters have battled 
fiercely over the timing. By correlating Egyp- 
tian records and pottery, archaeologists put 
the eruption as early as 1500 B.C.E. But radio- 
carbon dates from Akrotiri and nearby sites, 
including an olive tree buried by the erup- 
tion, pointed to a date more than 100 years 
earlier, in the late 17th century B.C.E. 
(Science, 28 April 2006, p. 508). 


“It’s a very impressive data set,” says 
Paula Reimer, a geochronologist at Queen’s 
University Belfast and chair of the Inter- 
national Calibration Working Group (Int- 
Cal), which establishes the worldwide 
radiocarbon calibration curve. If another lab 
can reproduce the findings, she says, every 
radiocarbon date between 1700 B.C.E. and 
1500 B.C.E. might need to be recalibrated. 

The radiocarbon clock doesn’t tick 
steadily. Organisms contain both carbon-14 
(#C) and carbon-12 (°C). At death, “C starts 
to decay at a known rate while ”C levels 
stay the same. By comparing the ratio of 
isotopes, scientists can calculate how long 
ago an organism was alive. But “C in the at- 
mosphere, and therefore in all organisms, 
fluctuates with the amount of cosmic radia- 
tion hitting Earth. To calibrate the clock, 
scientists must track those fluctuations. 
That’s where tree rings come in, because 
they yield both a radiocarbon reading and 
an absolute age. 

The IntCal curve was largely built us- 
ing 10-year chunks of wood. But Pearson’s 
team was able to measure the radiocarbon 
more precisely, in single, annual rings of 


Thera, now called Santorini, erupted 
sometime in the Bronze Age, creating a 
caldera now mostly filled with water. 


three ancient bristlecone pines in California 
and two oak trees in Ireland. The resulting 
calibration curve differs slightly but signifi- 
cantly from the IntCal curve, with a so-called 
radiocarbon plateau that allows a much 
broader range of eruption dates. Some fall in 
the late 16th century B.C.E, perhaps around 
1540 B.C.E.—closer to the archaeological date. 

“This slight adjustment to the shape 
of the calibration curve can allow all the 
lines of evidence to overlap,” Pearson says. 
“Tt doesn’t give us the definite date but it 
moves us a Step in the right direction.” 

For radiocarbon experts, the result is bitter- 
sweet, vindicating archaeologists without 
challenging the radiocarbon measurements 
themselves. Christopher Ramsey of the 
University of Oxford in the United King- 
dom helped date the Thera eruption to the 
17th century B.C.E. and has fielded years of 
critiques of his dates; just last week, for ex- 
ample, a paper in Scientific Reports showed 
that olive trees grow irregularly and might 
give artificially old dates. But if Pearson’s 
paper is correct, Ramsey says, “then all the 
measurements are fine!” It’s the calibration 
data—the gold standard used by research- 
ers around the world—that may be off. 
“That was not something I expected,” he 
says ruefully. 

Sturt Manning, an archaeologist at Cor- 
nell University who led the earlier radio- 
carbon work, worries the new findings 
will encourage traditionalists to disregard 
radiocarbon evidence altogether. “This pa- 
per will now be cited for the next decade to 
demonstrate that there’s ambiguity with the 

radiocarbon, and therefore we should 
ignore it,” he says. He emphasizes that 


A new study promises a truce. A BULGARIA Block Sea the new calibration data must be con- 
team led by archaeologist and tree firmed independently. 
ring scientist Charlotte Pearson of  /TALY Jeremy Rutter, an archaeologist 
The University of Arizona in Tucson GREECE TURKEY emeritus at Dartmouth College, con- 
measured the radiocarbon stored in siders the work “a real step forward.” 
individual rings from five trees. Be- y But, he says, “I’m a little disappointed 
cause tree rings can also be dated ” by where we end up,’ with no abso- 
by simply counting them, the team 0 250 Crete SYRIA lute date for the eruption. 
could correct the radiocarbon dates. kn : “It’s very bad luck that this really 
That broadens the possible dates of Mediterranean Sea important eruption happens to occur 
the eruption to include the tradi- JORDAN on a radiocarbon plateau,” Pearson 


tional archaeological date of the 16th 
century B.C.E. 
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says. “But on the other hand, this is 
science, isn’t it?” 
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AGRICULTURE 


Detailed genome maps paths to better wheat 


Chromosome-by-chromosome sequence helps navigate crop’s genomic complexity 


By Elizabeth Pennisi 


he world’s most widely planted ce- 
real has also proved to be among the 
hardest to improve. Plant breeders 
vastly increased wheat yields during 
the Green Revolution of the 1960s, 
but since then efforts to improve the 
crop through traditional breeding or ge- 
netic technology have been painstakingly 
slow because of the fiendish complexity 
of its genome. Thanks to a decadelong ef- 
fort, the wheat genome has finally come 
into sharp focus, speeding the search for 
genes that could boost harvests and even 
make wheat less likely to trigger allergies. 
The data, described on pp. 661 
and 662, represent the long- 
awaited culmination of the 
International Wheat Genome 
Sequencing Consortium, a mas- 
sive collaboration of academic 
and industry researchers from 
20 countries. Wheat geneticists 
say the newly finished genome, 
which pinpoints 107,000 genes 
on the 21 chromosomes of bread 
wheat, has transformed their re- 
search as they and others gained 
early access to it. “What took us 
years in the past now takes us 
one night,’ says Jorge Dubcovsky 
of the University of California, 
Davis, who recently found a new 
gene for wheat height. “It’s like 
walking with a Google map.” 
Already the new genome has 
helped plant geneticists from 
the John Innes Centre (JIC) in 
Norwich, U.K., to boost grain size by 20% in 
lab-grown wheat. In a preprint posted on the 
bioRxiv server in May, they report identifying 
multiple copies of a gene for grain size origi- 
nally found in rice, then bulking up wheat 
grains by mutating the genes using CRISPR 
gene-editing technology. Many more traits 
beckon. The new sequence “ushers in a new 
era in wheat genetics,” says James Anderson, 
a plant breeder at the University of Minne- 
sota in St. Paul who was not part of the effort. 
For him and thousands of other wheat re- 
searchers, bread wheat’s DNA has been an 
impenetrable thicket. Natural breeding be- 
tween two grasses many thousands of years 
ago gave rise to the durum wheat now used 
in pasta. That hybrid was mated with yet 
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another grass to yield the grain that makes 
everything from bread to beer all over the 
world. But this interspecies mingling pro- 
duced a genome more than five times the 
size of the human one, harboring three 
sets of very similar chromosomes, 21 pairs 
in all, with six copies of most genes. Lack- 
ing a map of this complex landscape, wheat 
breeders had trouble tracking genes from 
generation to generation to see whether a 
crossing had worked, and genetic engineers 
trying to alter a specific DNA sequence of- 
ten did not know where to find it. 

The consortium—born in 2005 as an 
initiative by Kansas farmers—has finally 
mapped the thicket by breaking out and 


To sprout in the spring, winter wheat must be planted the previous autumn— 
an extended growing cycle that some breeders would like to shorten. 


sequencing each chromosome separately. 
In contrast, a genome published last year 
by a group of academics had some longer 
stretches of contiguous sequence but did 
not describe genes or order and orient them 
on the chromosomes. The new genome 
“represents a major step forward,” says JIC’s 
Michael Bevan, whose group also produced 
a rival draft wheat genome last year, before 
working with the consortium. 

JIC researchers including Cristobal Uauy, 
Ricardo Ramirez-Gonzalez, and Philippa 
Borrill have now built on the consortium’s 
data to document gene activity in different 
tissues and at various points of the plant’s 
life cycle. They took 850 snapshots of mes- 
senger RNA levels to gauge which genes are 
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active under conditions including drought, 
pest attack, and other kinds of stress, aim- 
ing to trace networks of genes underlying 
yield and other traits. With these surveys, 
“we can see which of the copies are most 
useful to target,” Borrill says. 

So far, consortium members and others 
have drawn on the genome for more than 
100 published papers, says the consortium’s 
executive director, Kellye Eversole, who is 
based in Bethesda, Maryland. Their finds are 
already being put to use. For example, com- 
monly planted varieties won’t sprout unless 
the seeds have overwintered in the ground. 
Last year, plant geneticist Antje Rohde, now 
at BASF in Ghent, Belgium, reported her 
team had pinned down a key 
gene responsible for the sprout- 
ing delay. By disabling that gene 
using CRISPR, the team hopes to 
shorten the wheat breeding cycle. 

The genome could also help 
bolster wheat’s resistance to dis- 
ease. Drawing on the consortium 
data, Curtis Pozniak and Kirby 
Nilsen at the University of Sas- 
katchewan in Saskatoon, Canada, 
found a gene that makes wheat 
stems stiffer, and hence more re- 
sistant to stem-boring insect pests 
called sawflies. Stiffer wheat has 
more copies of the gene, Nilsen 
found, which points to ways to 
protect other wheat varieties. 

The genome could even aid 
human health, says consortium 
co-leader Rudi Appels, a molecu- 
lar geneticist at Murdoch Univer- 
sity in Perth, Australia. In Science 
Advances this week, he, fellow Murdoch Uni- 
versity researcher Angéla Juhasz, and Odd- 
Arne Olsen from the Norwegian University 
of Life Sciences near Oslo report identifying 
365 genes coding for wheat proteins that 
stimulate an immune or allergic response. 
The data could help breeders aim for less 
problematic wheat—what Appels calls “my 
personal dream.” 

“For the first time, people working in 
wheat have the quality of resources that 
people have in other crops,” says consortium 
co-founder Catherine Feuillet, chief scientific 
officer at Inari Agriculture, a new ag-biotech 
startup in Cambridge, Massachusetts. “We 
now have the tools to do [breeding] in a 
knowledge-based way.’ & 


17 AUGUST 2018 + VOL 361 ISSUE 6403 635 


8LOz ‘91 IsNBny uo /Hio Hewesualds9ouaIds//:d}jjy Wo1 papeojuUMOG 


NEWS 


he first thing that went through 
Alison Avenell’s head when she 
heard Yoshihiro Sato had died 
was that it might be a trick. It was 
March 2017, and in the previous 
years, Avenell, a clinical nutri- 
tionist at the University of Aber- 
deen in the United Kingdom, had 
spent thousands of hours combing 
through Sato’s papers, together with three 
colleagues in New Zealand. They had dis- 
covered that Sato, a bone researcher at a 
hospital in southern Japan, had fabricated 
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data for dozens of clinical trials published 
in international journals. “With so much 
going on, so much fabrication, you just 
wonder if it’s convenient for the person to 
go and hide,” Avenell says. 

Her second thought was that Sato might 
have killed himself. “We have no indica- 
tion that he committed suicide, but it con- 
cerns us,” Avenell said when I met her at 
her office in late 2017. Three years earlier, 
Japanese stem cell scientist Yoshiki Sasai 
had hanged himself in the stairwell of the 
RIKEN Center for Developmental Biology 
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in Kobe after he was caught up in a stem 
cell scandal. “We were aware of the culture 
in Japan and the dishonor something like 
this could bring,” Avenell said. 

It was one more mystery in a deeply 
unsettling case. 

Sato’s fraud was one of the biggest in sci- 
entific history. The impact of his fabricated 
reports—many of them on how to reduce 
the risk of bone fractures—rippled far and 
wide. Meta-analyses that included his tri- 
als came to the wrong conclusion; profes- 
sional societies based medical guidelines 
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TIDE OF LIES 


The researcher at the center of an epic scientific fraud remains 
an enigma to the scientists who exposed him 


on his papers. To follow up on studies they 
did not know were faked, researchers car- 
ried out new trials that enrolled thousands 
of real patients. Exposing Sato’s lies and 
correcting the literature had been a bruis- 
ing struggle for Avenell and her colleagues. 

Yet they could not understand why Sato 
faked so many studies, or how he got away 
with it for so long. They puzzled over the role 
of his co-authors, some of whom had their 
names on dozens of his papers. (“Do we hon- 
estly believe they knew nothing at all about 
what was going on?” Avenell asked.) They 


SCIENCE sciencemag.org 


By Kai Kupferschmidt, in Japan; 


Illustrations by Sara Gironi Carnevale 


wondered whether other doctors at his hos- 
pital read Sato’s work—and whether the Jap- 
anese scientific community ever questioned 
how he managed to publish more than 
200 papers, many of them ambitious stud- 
ies that would have taken most researchers 
years to complete. 

The tools of science that the group had 
used—analyzing studies, calculating sta- 
tistics, writing papers—could reveal fraud. 
But they could not expose the personal and 
cultural factors that drove it, or assess its 
emotional toll. So I set off on a quest that 


Published by AAAS 


would eventually lead me to the Mitate Hos- 
pital in Tagawa, a small town on the island 
of Kyushu, where Sato had worked in the 
last 13 years of his life. 


I: SUSPICION 

Avenell’s own quest began in 2006, when 
she was combing through dozens of pa- 
pers for a review evaluating whether vita- 
min D reduces the risk of bone fractures. 
In two papers by Sato, she stumbled on a 
weird coincidence. They described different 
trials—one in stroke victims, the other in 
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Parkinson’s disease patients—but the con- 
trol and study groups in both studies had 
the exact same mean body mass index. 
Looking further, she quickly found several 
other anomalies. She decided not to include 
Sato’s studies in her analysis. 

She wasn’t the first to notice something 
was off. In a 2005 Neurology paper, Sato 
claimed that a drug named risedronate re- 
duces the risk of hip fractures in women 
who have had a stroke by a stunning 86%. 
In a polite letter to the journal, three re- 
searchers from the University of Cambridge 
in the United Kingdom noted that the study 
was “potentially of great importance,” but 
marveled that the authors had managed to 
recruit 374 patients in just 4 months. 

Two years later, a letter in what was 
then the Archives of Internal Medicine was 
less polite. A study of male stroke patients 
published by Sato had managed to enroll 
280 patients in just 2 months; another 
one, of women with Alzheimer’s disease, 
recruited a staggering 500 in an equally 
short period. Sato claimed to have diag- 
nosed all of the Alzheimer’s patients him- 
self and done follow-up assessments of all 
780 patients every 4 weeks for 18 months. 
Both studies had very few dropouts, and 
both showed risedronate, again, to be a 
resounding success. “We are deeply con- 
cerned whether the data provided by Sato 
et al are valid,’ Jutta Halbekath of Arznei- 
Telegramm, a Berlin-based bulletin about 
the drug industry, and her co-authors 
wrote. Sato apologized in a published re- 
sponse and claimed the study had been 
conducted at three hospitals, not one. “The 
authors did not describe this fact, the rea- 
son being that these hospitals were reluc- 
tant to have their names in the article,” he 
wrote. He didn’t name the other hospitals 
or explain why they wanted to remain 
anonymous. The journal apparently ac- 
cepted the explanation. 

The letter’s authors also spotted a trou- 
bling pattern. In addition to the two papers 
in the Archives of Internal Medicine, they 
found 11 further studies by Sato, published 
elsewhere, that tested whether sunlight, vi- 
tamin D, vitamin K, folate, and other drugs 
could reduce the risk of hip fractures. All but 
two reported “extremely large effects with 
significant results,’ they noted. But the Ar- 
chives of Internal Medicine didn’t want to 
point fingers at other journals. “You may 
allude to your concern that other papers 
have similar concerns,” its editors warned 
Halbekath, “but we cannot allow you to men- 
tion those other papers by journal name.” 

By now, several researchers had raised 
red flags and waved them for everyone to 
see—and then everybody moved on. “The 
trail just went cold,” Avenell says. 
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Il: EVIDENCE 

Mark Bolland had never heard of Sato when 
Avenell first mentioned him in late 2012. She 
and Bolland, a clinical epidemiologist at the 
University of Auckland in New Zealand, have 
never met in person, but they joined forces 
to write meta-analyses on calcium supple- 
ments in 2008, together with Andrew Grey 
and Greg Gamble, both also at the Univer- 
sity of Auckland. One topic the quartet dis- 
cussed frequently was why meta-analyses 
on the same topic sometimes reach different 
conclusions. Avenell mentioned Sato’s stud- 
ies and noted that the effects they reported 
were so strong that they might swing meta- 
analyses if they were included. 

Intrigued, Bolland looked up the papers. 
He, too, was stunned by the large cohorts, 
the low number of dropouts, and the big ef- 
fects of almost any treatment tested. “There 
is nothing that I can think of that produces 
a 70% to 80% reduction in hip fractures, yet 
Sato was able to do it consistently in all his 
trials,” he says. 


“We have no indication 
that he committed suicide, 
but it concerns us.” 


Alison Avenell, University of Aberdeen 


To follow up on his suspicions, Bolland 
turned to statistics. When scientists com- 
pare a treatment and a control group, they 
usually report “baseline characteristics” for 
each—things like age, weight, and sex, or, in 
osteoporosis studies, bone density and cal- 
cium intake. From these values, scientists 
can calculate p-values that are a measure 
of the similarity of two groups for a given 
characteristic; the closer to one the value is, 
the more the groups resemble each other. 
Because the groups are randomly selected, 
the p-values should normally be “equally 
distributed”; the value for age or weight is 
just as likely to be between 0 and 0.1 as be- 
tween 0.9 and 1.0, for example. 

Bolland extracted the baseline charac- 
teristics from the 33 clinical trials Sato 
had published at the time, more than 
500 variables all in all, and calculated their 
p-values. More than half were above 0.8, 
he found. “That just shouldn’t happen,” he 
says. “The randomized groups were incredi- 
bly similar.” There was just one plausible ex- 
planation, he says: Sato had fabricated data 
for both groups and had made them more 
similar than they would ever be in real life. 

The team felt it had a damning indict- 
ment. “I thought: ‘This is so convincing. 


Everybody is going to believe this,” Avenell 
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says. Still, “It needed detailed statistical ref- 
ereeing, and it needed to be published by a 
journal so that other affected journals would 
take note,” she adds. So they wrote their ac- 
cusation as a scientific paper. All they had to 
do was publish it and wait for researchers, 
journals, and institutions to react, investi- 
gate, and retract. Or so they thought. 


III: ACCUSATION 

In March 2013, the team submitted the 
manuscript to The Journal of the Ameri- 
can Medical Association (JAMA), the high- 
est profile journal Sato had published in, 
and one it felt might have the resources 
for an in-depth investigation. After review- 
ing the evidence, JAMA Editor-in-Chief 
Howard Bauchner told the team the edi- 
tors would ask Sato and, if necessary, his 
institution to respond. 

Two years later, in April 2015, JAMA told 
the researchers the hospital had not re- 
sponded, and it would publish an “expres- 
sion of concern’—a short note to flag Sato’s 
JAMA paper as suspicious. It would not 
publish the whistleblowers’ paper, however; 
if the team had concerns about other pa- 
pers, it should contact the journals that had 
published them, Bauchner said. 

The four researchers were shocked. “To 
find out after waiting 2 years that in fact 
nothing much had really happened and, 
other than an expression of concern, was 
going to happen in JAMA, was quite frus- 
trating,’ Bolland says. (Bauchner declined to 
answer Science’s questions about the case.) 

Next, the paper was rejected by JAMA 
Internal Medicine, which had also pub- 
lished Sato’s work. The Journal of Bone 
and Mineral Research, a highly rated jour- 
nal in the osteoporosis field, said it would 
investigate Sato’s papers, but would not 
publish the manuscript either. The editors 
of Trials, which had not published Sato’s 
work, said it would not be appropriate to 
get involved. 

Bolland became demoralized. The other 
three persuaded him not to give up. “If 
you ever embark on something like this, 
make sure you have a good support team,” 
he says now. Avenell, too, was sometimes 
despondent. Whereas the other three re- 
searchers at least saw each other in Auck- 
land, she was on her own, frustrated, in the 
dreary, gray town of Aberdeen. Sometimes, 
she says, she would just sit in a corner of 
her open floor plan office and cry. 

Then, in June 2015, came a small success: 
The Journal of Bone and Mineral Research 
retracted one of the 33 trials the team had 
analyzed. A few other journals followed 
suit in the months after. But some seemed 
irritated by the group’s persistence. “It is ap- 
parent that the responses to the JAMA inves- 
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A far-reaching fraud 

A team of four researchers has worked since 2012 to expose scientific misconduct by Japanese bone 
researcher Yoshihiro Sato, who published more than 200 papers before he died in 2016. The team has 
focused on Sato’s 33 clinical trials, together involving 5894 patients. 
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The 12 trials Sato published in high-impact journals have been widely cited. Many were included in meta- 
analyses, sometimes changing the outcomes, or were translated into treatment guidelines. Other researchers 
used Sato’s fake data as part of the rationale for launching new clinical studies. 
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tigation by Dr. Sato and his institution have 
been either inadequate or not forthcoming,” 
Grey wrote to Bauchner in December 2015. 
“At what point will JAMA consider more de- 
cisive action, such as retraction?” “We will 
consider your opinion about how you think 
it best we should conduct the investigation,” 
Bauchner responded. “We often hear from 
people how they think we should perform 
our responsibilities as editors.” 

In what Bolland calls “really just 
the last throw of the dice,” that same 
month the group submitted the paper to 
Neurology, where Sato had published three 
papers about bone fractures in patients with 
neurological disease. When it was accepted 
8 months later, Avenell cried again. “I’m not 
one usually given to showing such emotion, 
especially when all I have is a computer 
screen and emails to look at,” she says. 

“Journals don’t really like going back to 
investigate when things go wrong,’ Grey 
concludes. “They complain that it’s time- 
consuming and laborious and difficult.” (It 
is all of that, says Avenell, “and no one ever 
thanks you for it.”) The group says investi- 
gations of this scale should not be handled 
by journals or institutions; it has suggested 
a levy on journals to fund an independent 
investigative body. 

By the time Newrology published the in- 
vestigation in December 2016, 10 of the 
33 trials had been retracted, all but one by 
journals the team had contacted. One month 
later, Avenell received an email from an edi- 
tor with troubling news. Sato was dead. 


IV: RIPPLES 

When scientists die, their published pa- 
pers live on—even if they’re based on lies. 
Downloaded in seconds from anywhere in 
the world, fake results continue to steal 
other scientists’ time, influencing their 
choice of which research avenues to follow 
and which trials to design and seek ethical 
approval for. 

Today, 21 of Sato’s 33 trials have been 
retracted by the journals or Sato him- 
self; Avenell has crossed them off a list 
taped next to her computer with a red 
marker. But now the team is following 
the ripples that the studies caused, focus- 
ing, for the time being, on a dozen papers 
published in the journals with the high- 
est impact factors. Together, these stud- 
ies reported results for 3182 participants. 
They have been referenced more than 
1000 times, and 23 systematic reviews or 
meta-analyses have included one or more of 
the 12 trials (see graphic, left). 

One meta-analysis, which found drugs 
called bisphosphonates to be highly effec- 
tive in preventing hip fractures in elderly 
patients with stroke or Parkinson’s, is based 
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entirely on eight trials from Sato, as he was 
the only one to study the issue. A key conclu- 
sion in another meta-analysis rests only on 
Sato’s four studies on bone mineral density 
in Alzheimer’s patients. Two other meta- 
analyses would probably come to different 
conclusions if Sato’s trials were removed, 
Avenell says. One of those, a review show- 
ing that vitamin K helps prevent fractures, 
was the basis of 2011 Japanese guidelines 
that recommend the supplement for people 
at risk. 

The fake trials led to further, real research. 
Eight trials referenced at least one of Sato’s 
fabricated papers in explaining the rationale 
for the trial. Researchers in the Netherlands, 
for instance, launched a huge study in 2008 
to determine whether B vitamins could help 
prevent hip fractures. Two previous studies 
found they didn’t, but Sato had observed “a 
large protective effect” in elderly women. 
“Given the conflicting results and low gen- 
eralizability to the general older population, 
further investigation is needed,” the Dutch 
researchers wrote to explain their think- 
ing. The 2-year study in 2919 elderly people 
found no effect of the vitamins. 

The fraud has also drawn attention to 
the two co-authors whose names appear 
on Sato’s papers most often. One is Kei 
Satoh, president of Hirosaki University, in 
a small town at the northern tip of Japan’s 
main island, Honshu. Sato worked at Hiro- 
saki University, where he collaborated with 
Satoh, until 2003; even after he left for Mitate 
Hospital, 1600 kilometers to the southwest, 
he and Satoh remained frequent co-authors, 
including on 13 of the 33 clinical trials. 

Satoh—whose name, confusingly, is 
sometimes spelled Sato—did not respond 
to Science’s emails. In a short letter to Grey, 
Hirosaki University Vice President Chizuko 
Kohri wrote last November that the uni- 
versity had asked “three outside experts” 
to investigate after the Neurology paper 
was published. The committee investigated 
38 papers, Kohri wrote. Of these, Sato had 
already retracted seven and wanted to re- 
tract another seven. The committee “con- 
cluded that there was research misconduct 
in these 14 papers,’ Kohri wrote, but that 
Sato alone was responsible. According to 
Japanese press reports, Satoh maintains 
that he only corrected the English in the pa- 
pers. As a sign of contrition, he gave up 10% 
of his salary for 3 months. 

Sato’s most important collaborator, 
however, was Jun Iwamoto. A board mem- 
ber of the Osteoporosis Society of Japan, 
Iwamoto was a senior lecturer at Keio Uni- 
versity in Tokyo—one of the country’s most 
prestigious—until 2017, when his contract 
wasn’t renewed in the wake of the Sato 
affair. He and Sato collaborated for more 


640 17 AUGUST 2018 + VOL 361 ISSUE 6403 


E 
& 
5 
5 


than a decade and published more than 
130 papers together, including 25 of the 
33 clinical trials. 

A panel at Keio University has been 
investigating Iwamoto’s clinical _ trials. 
Iwamoto told the panel that he first con- 
tacted Sato in 1998, when Iwamoto was 
working at the New York University Win- 
throp Hospital in Mineola. In 2002 they 
started to put each other’s name on every pa- 
per they authored. Still, Iwamoto claims he 
was unaware of Sato’s practice. “We talked 
to Dr. Iwamoto and in most of the papers 
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which Dr. Sato published, which included 
Dr. Iwamoto’s name, Dr. Iwamoto did not 
know that his name was included,’ says can- 
cer researcher Hideyuki Saya, who heads the 
investigation. The panel was “very shocked” 
by this, Saya says. At the same time, he says, 
“For Dr. Iwamoto it was an honor to put his 
name on Dr. Sato’s [papers] even though he 
did not know much about the content.” 
Although considered highly irregular to- 
day, such “gift authorships” were common 
in the recent past, Saya argues. A 2014 study 
in the International Journal of Japanese 
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Sociology found they are particularly com- 
mon in Japan. “We speculate that most 
natural science researchers in Japan may 
be either confused about or struggle with 
the situation where the strict global criteria 
conflict with specific local cultures that often 
condone gift and ghost authorships,’ the re- 
searchers wrote. 

Saya says the seven trials listing 
Iwamoto as the first author appear not 
to be fabricated. Data for the first four of 
those no longer exist, but Iwamoto can’t 
be faulted for that, says Saya, because un- 
der rules at the time they were conducted, 
he had to save the data for only 5 years. 
Iwamoto did provide data for three more re- 
cent trials. “That data, it seems, was really 
collected,” Saya says. 

But Avenell and her colleagues say they 
have uncovered many problems in trials on 
which Iwamoto was a first author as well. 
For instance, two of them, which tested a 
drug named alendronate, seem to include 
the same group of 25 patients, as indicated 
by their average age, height, serum calcium, 
and numerous other characteristics, but the 
two papers give different recruitment dates 
and inclusion criteria, and some of the out- 
come data differ. Saya chalks problems in the 
papers up to “immaturity.” “We do not think 
there is fabrication,” he says. 

Iwamoto now works at another hospital, 
Saya says. “He has a very nice reputation.” 
When I mention that I would like to talk to 
him, he suggests I should not. “He is very ex- 
hausted,” he says. “Better not to contact him 
at this moment.” Otherwise “the same thing” 
might happen that happened with Sato. 

“What happened with Sato?” I ask. “People 
say he committed suicide over this,” Saya 
says. But he doesn’t know whether that’s true. 


V: ENIGMA 

Sato’s fraudulent work has propelled him 
to No. 6 on Retraction Watch’s list of re- 
searchers who have racked up the most 
retractions. At the top is Japanese anesthesio- 
logist Yoshitaka Fujii, with 183 retractions; 
his frequent co-author Yuhji Saitoh, also 
from Japan, is at 10th place, while Japa- 
nese endocrinologist Shigeaki Kato is No. 8. 
Iwamoto is at No. 9. That means half of the 
top 10 are Japanese researchers. Yet only 
about 5% of published research comes from 
Japan. What explains the number of prolific 
Japanese fraudsters? 

Michiie Sakamoto, who is leading another 
investigation at Keio University, into Iwamo- 
to’s studies in animals, says it has to do with 
respect. “In Japan, we don’t usually doubt 
a professor,’ he says. “We basically believe 
people. We think we don’t need strict rules to 
watch them carefully.” As a result, research- 
ers faking their results may be exposed only 
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after they have racked up many publications. 

Outside researchers may also be less 
likely to question anomalous results from 
Japan. Several early critics of Sato’s work 
say they thought at first that his un- 
usual results might be due to something 
uniquely Japanese. One case in point: In 
2003, Sato published a study on data from 
40 patients with a very rare affliction 
named neuroleptic malignant syndrome, 
collected over 3 years. In a letter to the 
journal, a U.K. neurologist said he and 
his colleagues “could only recall two such 
cases in living memory’—but instead of 
casting doubt on the study, they said it was 
interesting that the syndrome seemed so 
prevalent in Japan. 

But none of that explains why Sato de- 
cided to embark on his fraud—and nobody 
seems to be able to shed much light on that 
question. “Given the number of papers he 
published, he must have spent a very large 
amount of time on them,” Bolland says. 
“I don’t understand what his gain was. ... 


“In Japan, we don’t usually 
doubt a professor. 

We basically believe people. 
We think we don't 
need strict rules to watch 
them carefully.” 


Michiie Sakamoto, Keio University 


There must have been some reason to do it.” 
The Keio University panel is just as puzzled. 
“We discussed this a lot in the committee,” 
Saya says. It might have been like a hobby, 
he suggests. A thrill. Saya uses the word 
“otaku, a Japanese term often applied to 
people who read manga obsessively. 
I thought I might find more clarity at the 
place where Sato perpetrated the fraud. 
Mitate Hospital is not known for its sci- 
entific excellence. Except for one 2006 paper 
on schizophrenia, its entire research output 
over the past 20 years was produced by Sato. 
The hospital is a sprawling complex of 
beige buildings set against green hills. I walk 
up to the reception. It’s quiet, no patients in 
sight. The receptionist does not understand 
me and asks me to write down what I want to 
say. “I am a journalist,’ I write. “I would like 
to talk to the director about Yoshihiro Sato.” 
Her eyes widen as she reads the name 
and she calls a nurse who speaks English. 
The nurse calls the director. “He does not 
want to talk to you,” she says after she hangs 
up. We stand awkwardly next to each at the 
reception desk, both embarrassed. It is clear 
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that everybody wants me to leave. 

As I walk back to the bus stop I look back 
at the hospital. It is an unlikely place for an 
unlikely story. What did people here think 
about the research superstar in their midst? 
What do they remember about the man? 

The effects of Sato’s fraud are still rip- 
pling out: citations, retractions, investiga- 
tions. But the place at the epicenter of the 
disaster reveals nothing. Mitate Hospital 
squats silently in the midday sun. 


Vi: EPILOGUE 

Hours before I leave Japan, I meet Iwa- 
moto’s lawyer, Satoshi Ogawa. We sit in the 
lobby of my Tokyo hotel, our words echoing 
from the bare walls and the marble floor. 
Ogawa says Iwamoto agreed to our conver- 
sation because he wants me to understand 
his point of view. “From his perspective, he 
is a victim.” Avenell’s team, says Ogawa, is 
now giving Iwamoto’s papers a level of scru- 
tiny that is unfair and is causing his client a 
great deal of distress. 

Ogawa says Sato wrote a detailed ac- 
count of his interactions with Iwamoto a 
year before he died. He shows me an Eng- 
lish version of the document, signed by Sato 
and witnessed by Ogawa and a notary. “I 
strongly requested Mr. Iwamoto to include 
my name as an author on the articles for 
which Mr. Iwamoto was the lead author,” 
Sato wrote. “I also started including Mr. 
Iwamoto’s name in the articles for which I 
myself was the lead author.” 

The letter does not mention fraud, how- 
ever. “I couldn’t force him to confess,” 
Ogawa says. “I think he had a mental ill- 
ness.” His emails were not logical, he says. 
“To tell the truth, I predicted that he would 
commit suicide.” 

Suicide. Is he sure that’s what happened? 

“T received the information from the law- 
yer of Mr. Sato,” Ogawa says. Sato also left a 
note, he says, and he paraphrases it: “I am 
very sorry for Mr. Iwamoto. I decided to com- 
mit suicide.” 

When I call Avenell after my return from 
Japan and tell her what I have learned, there 
is stunned silence at first. “That’s what we 
were dreading,” she says. “That’s horrible, re- 
ally horrible” Exposing the misconduct was 
important, she says. “Could we have done it 
without Sato committing suicide? So that he 
felt less guilty? I just don’t know.” 

Later she follows up with an email, still 
astonished at “how such a small piece of 
data analysis a long time ago can end up 
with someone dying.” As a clinician and a re- 
searcher, Avenell wrote, she knows her work 
can eventually make the difference between 
life and death. “But seldom is the connection 
between a clinician and another human be- 
ing’s death so obvious.” 
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PERSPECTIVES 


PLANT PATHOLOGY 


Plant nutrient 
acquisition 
entices 
herbivore 


Maize defense metabolites 
enable iron uptake and 
attract herbivore attention 


By Daniel J. Kliebenstein 


0 survive in highly complex environ- 
ments, plants universally rely on spe- 
cialized, or secondary, metabolites 
to withstand abiotic challenges (for 
example, wax to limit transpiration) 
and biotic challenges (for example, 
glucosinolates to deter herbivores). These 
metabolites are lineage-specific, and func- 
tional studies usually consider them to have 
a singular function. However, the complex- 
ity of the environment is much larger than 
the number of secondary metabolites within 
a plant, indicating that individual special- 
ized metabolites may need to have multiple 
roles. As the number of functions of a single 
metabolite increases, so does the number of 
proteins and processes affected, and there is 
no guarantee that all of these interactions 
are positive. On page 694 of this issue, Hu et 
al. (1) show that benzoxazinoids, a textbook 
example of specialized metabolites in maize, 
have a functional duality centered around 
iron acquisition. The benzoxazinoids aid the 
uptake of iron. However, this generates a 
penalty because the herbivore, western corn 
rootworm, senses benzoxazinoid-iron com- 
plexes as a cue to locate and consume maize 
plants. This presents an evolutionary quan- 
dary, whereby maize can acquire iron and be 
eaten or maize can starve itself of iron and 
avoid herbivory—most plants have metabo- 
lites that create a similar conundrum. 
Benzoxazinoids are critical for how maize 
interacts with a wide array of environmen- 
tal components. They attract beneficial mi- 
crobes to the rhizosphere (the region of soil 
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in contact with plant roots) to help shape 
the maize microbiome (2). They are also 
critical for defense against specialist and 
generalist herbivores through unknown 
mechanisms. However, benzoxazinoids are 
ineffective in defending against the west- 
ern corn rootworm (3, 4) because this her- 
bivore has developed the ability to modify 
and store benzoxazinoids as its own anti- 
parasitic drugs (anthelmintics). This allows 
the western corn rootworm to deter patho- 
genic nematodes (5). This interaction is not 
benzoxazinoid-specific and is a hallmark 
of most plant specialized metabolites, with 
numerous studies showing that individual 
metabolites have beneficial and detrimental 
roles that can span multiple trophic levels 
(6). For example, glucosinolates that are spe- 
cific to the Capparales order provide defense 
against pathogens, insects, and avian herbi- 
vores (7, 8). Simultaneously, aphids can ac- 
cumulate glucosinolates to provide defense 
against predators, which results in a net cost 
to the plant by promoting aphid survival (9). 
Similar opposing interac- 
tions are observed with the 
cardenolides of Asclepias 
(milkweeds), and the gly- 
coalkaloids of the Solana- 
ceae (nightshades) order (6). 
Thus, even though each indi- 
vidual metabolite is lineage- 
specific, the complex system 
of beneficial and detrimen- 
tal functions across trophic 
levels is shared across plant 
lineages. This complexity of 
benefits and costs spread 
across potential abiotic and 
biotic interactions highlights 
the difficulty of adequately 
describing or measuring the fitness conse- 
quence of any specific metabolite. 
Developing approaches to assess the fit- 
ness consequences is essential to under- 
stand the evolutionary pressures associated 
with the natural occurrence as well as the 
synthetic biology application of these natu- 
ral pesticides into new plants. The study of 
Hu et al. shows that the fitness benefit at- 
tributed to a single role of any specialized 
metabolite is inadequate. Instead, a full 
description of the cost and benefit of that 
compound in every interaction in which it 
plays a role is required. This would involve 
cataloging the frequency with which each 
of the different environmental interactions 
may occur. The conditional interaction of 
multiple positive and negative functions 
with a fluctuating environment generates 
a complex fitness landscape. It is possible 
that the environment is sufficiently com- 
plex that there may not be one solution, but 
instead, individuals within a species may 
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Awestern corn rootworm forages 
for maize roots. 


have different metabolic solutions to maxi- 
mize species fitness (JO, 17). 

Additionally, the mechanisms by which 
specialized metabolites function are largely 
unknown. It could be that opposing func- 
tions are the result of the metabolite having 
an independent mechanism in each interac- 
tion. For example, the specialized metabolite 
apigenin can bind a diverse range of proteins 
and variably affect their functions, with each 
interaction having potentially contradictory 
impacts on fitness (72). Alternatively, per- 
haps fitness costs and benefits arise from 
a single related mechanism as its function 
disseminates across the interaction of the 
plant with the environment. Hu et al. show 
that the cost and benefit of benzoxazinoids 
explicitly link to a single iron-binding (sid- 
erophore) function, which simultaneously 
allows iron uptake and invites the attack of 
a major herbivore. The authors propose a 
linear pathway whereby the plant may have 
first utilized these compounds to acquire 
iron and the rootworm then evolved the abil- 
ity to track the compound to 
locate food. 

The findings of Hu et 
al. will lead to exciting fu- 
ture studies necessary to 
assess if the siderophore 
function accounts for the 
majority of benzoxazinoid 
functionality. Alternatively, 
do benzoxazinoids have 
other, unrelated biological 
mechanisms? If a compound 
has multiple independent 
mechanisms, it is unlikely 
that these arose in a lin- 
ear fashion; instead, they 
may represent evolutionary 
serendipitous interactions. Serendipity 
raises complex issues for synthetic biology 
efforts to transfer specialized metabolites 
from one plant to another because unex- 
pected interactions should be anticipated 
even in plants naive to the metabolite (73). 
Hu e¢ al. have taken the first step to under- 
standing how specialized metabolite func- 
tions may have evolved and concurrently 
what engineering them involves. 
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DEVELOPMENT 


Choosing the 
right input in 
cell signaling 


A decoding module 
discriminates ligands 
and signaling activation 
in the Wnt pathway 


By Kibeom Kim and Lea Goentoro 


ver the evolutionary history of mul- 

ticellular animals, several ancient 

signaling pathways have been con- 

served and reused for a myriad of 

functions. One such pathway is the 

Wnt signaling pathway, the almost 
ubiquitous roles of which include vital 
processes such as organogenesis, stem cell 
maintenance, and regeneration (1). This 
diversity of functions of the Wnt pathway 
may explain the diversity of ligands and re- 
ceptors that activate these processes. In hu- 
mans, there are 19 Wnt ligands, 10 Frizzled 
(FZD) receptors, and two lipoprotein recep- 
tor-related protein (LRP) co-receptors, as 
well as other receptors, such as RAR-related 
orphan receptor and receptor-like tyrosine 
kinase (2). Despite this diversity, binding 
interactions between Wnt ligands and FZD 
receptors are highly conserved (1), so that 
multiple Wnt ligands can bind to a FZD, 
and each Wnt ligand can bind to multiple 
FZDs to elicit cellular responses (3). Given 
this promiscuity, it is an ongoing pursuit 
to understand how cells can selectively re- 
spond to a specific Wnt ligand. On page 663 
of this issue, Eubelen et al. (4) pursue this 
question and resolve the mechanism of how 
WNT7 selectivity is achieved. Resolving the 
mechanism for ligand discrimination has 
implications for understanding information 
flow in cell signaling and reveals new thera- 
peutic opportunities. 

Several strategies have been proposed 
for how cells could detect a specific Wnt 
ligand. Some studies showed that dif- 
ferent FZD receptors show intrinsically 
varying binding affinities for different 
Wnt ligands (5). Thus, selectivity could be 
achieved to some extent by expressing the 
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FZDs with high affinity for a specific Wnt 
ligand. Another strategy for selectivity is 
through regulating the spatiotemporal ex- 
pression of Wnt ligands, which is observed 
during development and in adult tissues 
(1). However, it is also typical for various 
Wnt ligands to be expressed in the same 
tissue. Another way to ensure specificity 
is through receptor cofactors. Two cell- 
membrane proteins, reversion-inducing 
cysteine-rich protein with Kazal motifs 
(RECK) and G protein-coupled receptor 
124 (GPR124), selectively enhance WNT7 
signaling in vascular development in ver- 
tebrate brains (6, 7). Although RECK and 
GPR124 were known to physically interact 
with each other and with FZD4 and LRPs 
(6, 7), it was not known how their involve- 
ment confers selective binding to WNT7. 
Eubelen et al. generated a human em- 
bryonic kidney cell line in which all 10 FZD 


naling. Indeed, the authors demonstrated 
that the intracellular domain of GPR124 
binds to Dishevelled, a cytosolic adaptor 
that upon Wnt ligand-receptor binding 
is recruited to FZD. Because Dishevelled 
facilitates Wnt pathway signalosome as- 
sembly through its ability to polymerize 
(8), the authors hypothesized that GPR124 
could function to bring together RECK- 
WNT7 to FZD-LRP. Consistently, they 
showed that GPR124 colocalizes with FZD4 
in a Dishevelled-dependent manner. 

The elucidation of the mechanism by 
which RECK and GPR124 function as “a 
decoding module” of WNT7 by Eubelen et 
al. (see the figure) raises questions on how 
such ligand selectivity at the membrane is 
transduced into the nucleus. WNT7 signal- 
ing in brain angiogenesis is transduced 
downstream through activation of the 
B-catenin pathway (9). But many other 


WNT7 signaling in brain vascular development 

GPR124-RECK functions as a decoding module to specifically recruit WNT7 and present this ligand, ina 
Dishevelled-dependent manner, to the FZD-LRP5/6 receptor complex. WNT7 signalosome assembly leads 
to Dishevelled phosphorylation (P) and activation of B-catenin. 


Decoding module 


WNT ligands 


Signalosome selective for WNT7 


and two LRP genes were inactivated. In 
the absence of FZDs and LRPs, the authors 
found that RECK and GPR124 could not 
transduce WNT7 signaling, but that RECK 
physically binds to WNT7. By itself, RECK 
functions as a competitive inhibitor of sig- 
naling by sequestering WNT7. However, 
the presence of GPR124 converts RECK 
from inhibiting into potentiating WNT7 
signaling. Therefore, RECK functions as a 
specificity factor that recruits WNT7 to the 
cell membrane, and GPR124 determines 
signaling activation. 

How does GPR124 facilitate this WNT7- 
mediated signaling? Structure-function 
analysis in zebrafish brain angiogenesis re- 
vealed that the intracellular domain is nec- 
essary, but not the transmembrane domain. 
This suggests that GPR124 acts as a signal- 
ing-deficient receptor protein in WNT7 sig- 


644 17 AUGUST 2018 + VOL 361 ISSUE 6403 


Dishevelled 
polymerization 
a 


Se B-catenin 


Wnt ligands—for example, WNT3A—can 
also signal through B-catenin. Why would 
a sophisticated mechanism for WNT7 
specificity evolve, only for the signaling 
to converge downstream on f-catenin? It 
would be interesting to investigate how 
the selectivity achieved in Wnt ligand- 
receptor interaction is reflected in B- 
catenin activation. For instance, might the 
WNT7 signalosome produce B-catenin ac- 
tivation with different dynamics than that 
of the WNT3A signalosome? Such ligand- 
specific signaling dynamics has been dem- 
onstrated in the Notch pathway, in which 
delta-like ligand 1 (DLL1) produces pulsa- 
tile downstream signaling, whereas DLL4 
produces sustained signaling, which leads 
to different transcriptional outcomes (10). 

Many Wnt ligands, including WNT’, can 
activate the canonical B-catenin pathway 
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as well as the noncanonical pathways, such 
as the planar cell polarity and Ca” signal- 
ing (11), by engaging with different recep- 
tors (72). Moreover, Dishevelled plays a 
role in both the canonical and noncanoni- 
cal pathways and interacts with multiple 
Wnt receptors (13, 14). Thus, considering 
these seemingly nonspecific uses of WNT7 
and Dishevelled, it will be interesting to 
investigate how the mechanism uncovered 
for WNT7 ligand specificity is transduced 
into activation of select downstream path- 
ways and, in particular, if the decoding 
module plays a role in this. 

Evolving a modular mechanism for li- 
gand specificity, separate from core signal 
transduction, may allow flexibility in fine- 
tuning the two processes independently 
for use in diverse contexts. Within this 
modular architecture, it is intriguing that 
a decoding module could have a dual func- 
tion as a potentiator or inhibitor of signal- 
ing. Future investigations should examine 
whether the GPR124-RECK decoding mod- 
ule is used in other contexts, and whether 
there are different decoding modules for 
different Wnt ligands. Last, malfunctions 
in the Wnt pathway are found in diseases, 
including neurological disorders and can- 
cer (15). Because Wnt signaling is critical 
in multiple organs, targeting core compo- 
nents of the pathway is likely to lead to 
multiple side effects (15). Despite ongoing 
clinical trials, no drugs targeting the Wnt 
pathway have been approved. However, 
having a decoding module that confers 
specificity to Wnt ligand signaling could 
lead to more selective targeted therapy. 
For example, cerebrovascular integrity is 
disrupted in various neurological diseases, 
including glioblastoma and stroke, and un- 
derstanding the mechanism for specifically 
activating WNT7 signaling, which controls 
cerebrovascular development, holds excit- 
ing therapeutic promise. 
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MATERIALS 


Caged clusters shine brighter 


The structure of small silver clusters confined in zeolites 
helps to explain their luminescent properties 


By Marta Quintanilla! and 
Luis M. Liz-Marzan’? 


etal clusters consisting of only a few 

atoms have different physical prop- 

erties than single atoms and larger 

nanoparticles. Like molecules, they 

have discrete energy levels that al- 

low electronic transitions; these 
transitions are the source of their distinct 
properties (7). To maintain their special fea- 
tures, clusters must be protected against 
aggregation and chemical modifications. 
Scientists have developed water-dispersible 
clusters that are stabilized with the help of 
short peptides, proteins, DNA, or polymers 
(2). Clusters may also be stabilized in solid- 
state matrices such as glasses, zeolites, or 
ultracold noble gases. On page 686 of this is- 
sue, Grandjean et al. (3) elucidate the origin 
of silver-cluster luminescence in a sodalite 
zeolite, a simple model that can withstand 
high-energy irradiation and displays bright 
green luminescence. They link the proper- 
ties of the emitted light to different cluster 
structures in the zeolite. 

The use of zeolites as scaffolds to encage 
silver clusters is particularly relevant for 
practical applications, because they can be 
produced reliably in industrial quantities 
at low cost. Zeolites have well-defined mo- 
lecular-scale channels and cavities that are 
perfectly suited to host small clusters. Fur- 
thermore, the zeolite framework contains 
cation-exchange sites that facilitate incor- 
poration of metal ions, which form clusters 
upon heat treatment or x-ray irradiation. 

The luminescence of silver clusters stabi- 
lized in zeolites depends on the topology of 
the zeolite, thermal treatment, presence of 
charge-balancing ions (K*, Na*, or Ca‘), and 
silver loading. Variation of these parameters 
gives access to luminescent emission under 
ultraviolet excitation in a broad range of 
wavelengths, from blue to near-infrared, and 
an external quantum yield (the conversion ef- 
ficiency of incident light to emitted photons) 
that, in some cases, may be as high as 97% (4). 

The high photostability and quantum yield 
of zeolite-encaged silver clusters make them 
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outstanding candidates for light-based ap- 
plications. For example, fluorescent lamps, 
light-emitting diodes (LEDs), and organic 
LEDs use wavelength converters to control 
the color of the output light. In the case of 
commercial LEDs, there is still a need for 
the development of efficient converters that 
produce warm white light. This is a field in 
which zeolite-confined silver clusters appear 
as a highly competitive candidate (5). These 
materials have also been proposed as optical 
humidity sensors and luminescent tags (6). 


Confined clusters 

Schematic representation of a hydrated Ag, cluster 
inside a sodalite zeolite cage. The specific structure 
of this nanocomposite material is responsible for 

a bright photoluminescence, which can be tuned to 
produce a wide variety of colors. 


Sodalite cage 


Hydrated silver 
cluster 


Two main limitations restrict the devel- 
opment of applications for this exciting 
material. First, typical zeolite particle sizes 
(micrometers to tens of micrometers) com- 
promise effective use as optical sensors and 
tags, which require smaller particles that can 
access remote areas of interest, for example, 
in biological imaging. Recently developed 
protocols yield zeolite particles with nano- 
meter dimensions and high colloidal stability 
(7), but these protocols have not yet been ap- 
plied to the silver-clusters technology. 

The second limitation is the practical 
challenge of characterizing the structural 
and electronic properties of the material. 
Intricate host-guest interactions introduce 
interrelated effects that make it difficult to 
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draw general conclusions. Further, the sen- 
sitivity of both clusters and zeolites toward 
highly energetic probing beams limits the 
use of powerful techniques such as x-ray 
diffraction, extended x-ray absorption fine 
structure (EXAFS), and transmission elec- 
tron microscopy (8). Photoluminescence 
experiments, combined with conventional 
structural-characterization techniques such 
as infrared and Raman spectroscopies, can 
partially disentangle the relations between 
structural features and luminescent proper- 
ties. However, the results are, in some cases, 
contradictory, preventing a complete un- 
derstanding of the system that could enable 
prediction, and thus tailoring, of the prop- 
erties of silver clusters in zeolites (6). 

Grandjean et al. now report a systematic 
analysis of silver clusters in a sodalite zeo- 
lite with x-ray excited optical luminescence 
EXAFS and time-resolved spectroscopy. 
These spectroscopic methods make it pos- 
sible to selectively determine the structure 
of the emitting silver species involved in 
photoluminescence. The authors unam- 
biguously identify complex hydrated silver 
clusters [Ag,(H,O),”* (wv = 2 and 4)] at the 
center of sodalite cages (see the figure). 
This is an important finding in a field in 
which structural models have often been 
incomplete and sometimes contradictory. 
The complementarity of the experimental 
techniques chosen by the authors was es- 
sential to achieve this goal. 

Theoretical modeling indicates that the 
remarkable optical properties resulting 
from this structure originate from a con- 
fined superatom quantum system in which 
hybridized silver and water oxygen orbitals 
are highly delocalized over the cluster. Upon 
excitation, one electron of the s-type highest 
occupied molecular orbital is promoted to 
the p-type lowest unoccupied molecular or- 
bitals and relaxes through enhanced system 
intercrossing into long-lived triplet states. 

The authors’ approach may be extended 
to provide detailed understanding of the 
luminescent properties of other materials 
containing few-atom clusters made of silver 
and, perhaps, other metals. Such studies 
may yield a whole class of materials with 
tailored properties, based on luminescent 
clusters, that will certainly find technologi- 
cal applications in the near future. 
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IMMUNOLOGY 


Phase separation focuses DNA sensing 


Liquid droplets form upon recognition of DNA to stimulate innate immune responses 


By Andrea Ablasser 


he presence of any double-stranded 
DNA (dsDNA) in the cytosol is a 
potent trigger for the activation of 
innate immune responses. In the con- 
text of an infection, immunorecogni- 
tion of “foreign” dsDNA serves as a 
central host strategy to detect the presence 
of pathogens. Under certain conditions, 
however, sensing of aberrant cytosolic self- 
dsDNA can also promote destructive inflam- 
mation in autoinflammatory diseases. A 
fundamental question is how cells spatially 
organize and biochemically regulate the rec- 
ognition of intracellular dsDNA in the vast 
cytosolic space of a cell. On page 
704 in this issue, Du and Chen (1) 
introduce an elegant explanation 
by demonstrating that phase sepa- 
ration of the dsDNA sensor cGAS 
[cyclic GMP (guanosine mono- 
phosphate)-AMP (adenosine mo- 
nophosphate) synthase] enables 
this central task. This adds to the 
cellular processes that are facili- 
tated by phase separation. 
When it comes to innate im- 
mune detection, viruses are ex- 
quisitely challenging pathogens. 


Exploiting several host-cell pro- e 


cesses for their own propaga- ~ N terminus 
tion, viruses usually express few C terminus 
pathogen-specific molecular sig- 
natures that could serve as motifs e 

a e@ ———— 
for pattern recognition receptors e<— 


of the innate immune system. A 
central recognition strategy that 
virus-infected cells instead rely on 
is the detection of “mislocalized” 
nucleic acids. The nucleotidyltransferase 
cGAS, identified by the same laboratory in 
2012 (2), is a pivotal innate immune recep- 
tor that detects pathogenic dsDNA. In re- 
sponse to binding dsDNA, cGAS catalyzes 
the synthesis of the second messenger mol- 
ecule cGAMP (cyclic GMP-AMP), which 
triggers a series of cellular events orches- 
trated mainly by activating the endoplasmic 
reticulum-localized protein STING (stimu- 
lator of interferon genes) (3-5). A hallmark 
outcome of cGAS-cGAMP-STING signaling 
is the expression of various cytokines and 
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type I interferons that collectively elicit a 
powerful innate immune response. 
Structural studies have provided impor- 
tant insights into the nature of the cGAS- 
DNA interaction at the molecular level, 
indicating the formation of a tetrameric 
(2:2) complex, which, on long dsDNA mol- 
ecules, can further create ladderlike cGAS 
networks (6-8). Intriguingly, at the cellular 
level, activation of cGAS by dsDNA occurs in 
a length-dependent manner and is accompa- 
nied by the formation of discrete cGAS-con- 
taining cytosolic foci. However, it remained 
unclear how the formation of these aggre- 
gates could be explained in molecular terms 
and what functional properties they may 


possess. Du and Chen now define them as 
membraneless compartments that are gen- 
erated through a process of DNA-mediated 
liquid phase separation of cGAS. 
Liquid-liquid phase separation is a 
physical process whereby components in 
a solution separate into two coexisting 
phases—one dense phase and one dilute 
phase. Biologically regulated phase separa- 
tion has emerged as a key organizing prin- 
ciple underlying the formation of several 
membraneless compartments in living cells, 
including, for example, stress granules, the 
nucleolus, and P bodies in the cytoplasm 
(9-12). These liquid-like structures can con- 
centrate or sequester specific biomolecules 
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Phase separation promotes innate DNA signaling 
Cytosolic dsDNA activates a potent innate immune response through cGAS 
activation. dsDNA nucleates cGAS, which is further concentrated into liquid-like 


droplets to promote the synthesis of CGAMP, which activates STING. 


-— Typel 
interferons 


~__ cytokines 


and thereby potentiate or dampen specific 
biological reactions. Mechanistically, phase 
separation is driven by multivalent inter- 
actions and, of note, can be facilitated by 
polymeric scaffolds such as nucleic acids, to 
which these proteins bind (13). 

Du and Chen found that in the presence 
of dsDNA, cGAS demixes in solution and 
forms liquid-like droplets, which can fuse 
with one another to generate larger spheri- 
cal units in vitro. The cGAS within these 
droplets is highly mobile, and, over time, the 
condensates acquire solid properties remi- 
niscent of colloidal gels. Notably, in living 
cells, DNA-induced cGAS foci bear charac- 
teristics of phase-separated compartments in 
that they display fusion events 
with each other or allow rapid 
exchange of molecules into the 
droplets, including the cGAS 
substrates adenosine triphos- 
phate (ATP) and guanosine 
triphosphate (GTP). 

Could phase separation of 
cGAS regulate its enzymatic 
. activation? The authors show 


Nucleus wisi, that cGAMP synthesis corre- 

Endoplasmic “ene A lates with the degree of phase 

reticulum _ separation in vitro and that, 
‘— Inflammatory 


within cells, active cGAS is 
of contained in droplets. These 
findings point to a model in 
which dsDNA nucleates cGAS, 
which is further concentrated 
into liquid-like droplets to 
promote the synthesis of 
cGAMP, which in turn acti- 
vates STING at the endoplas- 
mic reticulum (see the figure). 
Interestingly, Du and Chen 
note that RNA and single-stranded DNA 
(ssDNA) also demix cGAS in vitro yet fail 
to activate cGAS. Thus, ligand specificity is 
not required to drive phase separation, but 
it defines the enzymatic activation of cGAS. 
Whether sequestering of cGAS via RNA- or 
ssDNA-induced demixing, as might occur 
during virus infection or erroneous accumu- 
lation of self-nucleic acids, negatively affects 
innate DNA sensing and the attendant im- 
mune responses remains an interesting issue 
that needs to be addressed. 

Exploring the conditions under which the 
formation of cGAS droplets occur, Du and 
Chen discover the existence of sharp bound- 
aries that demark a switchlike behavior of 
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phase separation. In addition to cGAS pro- 
tein or dsDNA concentration, the length of 
the dsDNA molecules determines the thresh- 
old of cGAS unmixing. This observed length 
dependency reflects cellular responses to 
dsDNA (7). Interestingly, they find that, by 
providing multiple sites for intermolecular 
interaction with dsDNA, the disordered, pos- 
itively charged N-terminal domain of cGAS 
enforces both the formation of lipid droplets 
as well as the production of cGAMP, empha- 
sizing the importance of multivalence in pro- 
moting phase separation. 

Phase separation is highly sensitive to 
changes in the chemical composition of the 
(cellular) environment. Du and Chen found 
that zinc ions, by stabilizing the DNA-cGAS 
complex, potentiate both droplet forma- 
tion of cGAS as well as cGAMP production. 
More generally, these findings suggest that 
the cellular physicochemical environment 
may serve as a new checkpoint to regulate 
the DNA-sensing capability of cGAS and po- 
tentially other nucleic acid sensors. Under- 
standing how the chemical composition of 
an environment modulates the phase-sepa- 
rating behavior of cGAS will be an interest- 
ing area for future investigation. 

The study of Du and Chen offers a fresh 
perspective on the organization of innate im- 
mune signaling in general. What applies to 
cGAS may also be true for other cytosolic or 
membrane-bound receptors or adaptor mol- 
ecules. Many intriguing questions remain for 
the future. What is the fate of cGAS droplets 
in living cells, and what are the molecular 
determinants that control this process? The 
study of Du and Chen used synthetic dsDNA. 
Thus, it will be interesting to assess the liquid 
phase-based organization of cGAS in differ- 
ent physiological scenarios, such as during 
pathogen infection or under conditions of 
endogenous DNA stress. Perhaps different 
pathogenic or endogenous DNA ligands or 
distinct biological contexts lead to differ- 
ences in cGAS droplets with distinct cellular 
outcomes. Undoubtedly, it is important to un- 
derstand the underlying principles governing 
the dynamic and layered discrimination of 
self- versus nonself-dsDNA by cGAS. 
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Photocatalytic upgrading 


of natural gas 


Cerium salts catalyze formation of value-added 
molecules from methane and small alkanes 


By Motomu Kanai 


he recent shale gas revolution (J) 
continues to increase the estimates 
of the amount of natural gas both 
held in reserve and currently be- 
ing produced. Natural gas contains 
methane (CH,) as the major compo- 
nent, in addition to ethane, propane, and 
butane as minor components. Natural gas 
is normally used as a fuel but could also 
be a cheap starting material for industrial 
chemical synthesis of value-added organic 
molecules (2), if ways are found to activate 
natural gas so that functional groups could 
be added. On page 668 of this issue, Hu 
et al. (3) report an important step toward 
such chemical processes. They upgraded 
natural gas to functionalized organic mol- 
ecules by using a catalytic cerium salt and 
photoirradiation at ambient temperature. 

Functionalization of alkanes must over- 
come the hurdles of low reactivity and poor 
selectivity (4). Cleavage of a C-H bond in 
alkanes for functionalization requires 105 
kcal/mol. This high bond-dissociation energy 
(BDE) is one reason why natural gas is pre- 
served under extreme conditions in Earth’s 
crust on geological time scales. Chemical 
activation of alkanes through C-H bond 
cleavage generally requires harsh conditions, 
such as high temperature, superacid media, 
strong oxidants, or ultraviolet irradiation. 
Because the products and solvents are usu- 
ally more reactive than alkanes, selective 
functionalization of the starting alkanes 
without product and solvent side reactions 
is difficult. Further, taking advantage of the 
scale of the natural-gas resource requires an 
inexpensive catalyst that is reactivated for 
many cycles, that is, one that exhibits very 
high catalyst turnover. 

To overcome these difficulties, Hu et al. 
devised an alkoxy radical (RO’, where R = 
CH,, CCl,CH,, or CF,CH,) generated from a 
mixture of a cerium salt and an alcohol un- 
der blue light-emitting diode (LED) photo- 
irradiation in an acetonitrile (CH,CN) 
solvent, as a catalyst for C-H bond cleav- 
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age of CH, and small alkanes (see the fig- 
ure). Because an O-H bond is as strong as 
a C-H bond, the hydrogen-atom transfer 
process, which cleaves a CH, C-H bond by 
RO’ and forms a methyl radical (‘CH,) and 
an alcohol (RO-H), is thermodynamically 
neutral. This feature drives the reactivity 
of an alkoxy radical (RO*’) in C-H bond 
cleavage of methane. 

Selectivity is dictated by a polarity- 
matching effect (5). An alkoxy radical is 
electrophilic because of the high electron 
affinity of oxygen atoms. Thus, RO’ prefers 
to generate a nucleophilic radical, such as 
a ‘CH,, because electrons smoothly flow 
from the C atom of CH, to the O atom of 
RO’ in the transition state of hydrogen- 


“Compared with previous 
examples of catalytic 
functionalization of CH,... 
the method developed by 
Hu et al. proceeds under 
markedly mild conditions...” 


atom transfer from CH, (polarity match). 
In hydrogen-atom transfer from CH,CN, 
however, both RO’ and °CH,CN are elec- 
trophilic. Thus, a competition between 
the two radicals for an electron on the 
abstracted H atom takes place that makes 
hydrogen-atom transfer from CH,CN by 
RO’ unfavorable (that is, there is polar- 
ity mismatch), despite CH,CN containing 
an intrinsically weaker C-H bond (BDE = 
93 kcal/mol) than in CH,,. In this way, the 
radical reaction avoids the unproductive 
pathway of reaction with the solvent. 

How can this reactive and selective alk- 
oxy radical be generated? The answer by 
Hu et al. relies on photoactivation (6). A 
variety of reaction patterns have been real- 
ized with hybrid catalyst systems compris- 
ing a photosensitizer that absorbs light 
energy and promotes redox reactions be- 
tween catalysts and reagents (7, 8). Hu et 
al. used the photoinduced ligand-to-metal 
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charge transfer of a tetravalent cerium 
alkoxide (ROCe!’) to generate an alkoxy 
radical (9). Irradiation with blue light 
[wavelength (A) = 400 nm] excites the 
cerium alkoxide catalyst, and homolytic 
cleavage of the ROCe”’ bond generates the 
key alkoxy radical (RO’) and a one-electron 
reduced trivalent Ce™ salt. 

The RO’ selectively cleaves a C-H bond 
of CH, through hydrogen-atom transfer, 
generating ‘CH,. Alkyl radicals, including 
‘CH,, are highly reactive and react with a 
NEN, C=C, or C=N bond of the other sub- 
strate; di-tert-butyl azodicarboxylate is 
shown as a representative substrate in the 
figure. The resulting radical intermediate 
is reduced by Ce™, regenerating the Ce” 
alkoxide catalyst and affording functional- 
ized products of CH,. In a representative 
example, CH, reacted with di-tert-butyl 
azodicarboxylate and was converted to 
an N-methylhydrazine derivative with a 
catalyst turnover number of 2900, mean- 
ing that one molecule of a simple cerium 
alkoxide catalyst can generate 2900 value- 
added organic molecules starting from 
CH,. The reaction can also be performed 
in a continuous-flow system, enabling easy 
handling of gas-phase alkane substrates, 
enhanced efficiency in light utilization, 
and straightforward scale-up (10). 

Compared with previous examples of 
catalytic functionalization of CH, (11) and 
other alkanes, the method developed by Hu 


et al. proceeds under markedly mild condi- 
tions (room temperature and near neutral 
pH). Thus, this method may be suitable for 
application to the synthesis of a variety of 
complex molecules, including drugs. Both 
functional group tolerance and control of 
stereoselectivity are essential for develop- 
ment in this direction, but these points 
remain unaddressed at this point. Addi- 
tionally, the catalyst turnover efficiency 
and quantum yield (currently, ~0.2%) 
must be improved, considering the large 
scale needed for the processing of natural 
gas. Despite these potential hurdles, the 
achievement by Hu et al. is a step forward 
in using carbon feedstock natural gas as a 
starting material in the creation of organic 
molecules that benefit our society. 
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Photocatalytic upgrade of methane 
The major component of natural gas, methane, is abundant but chemically inert. A photocatalytic method 
(shown) can activate and functionalize methane as well as other small alkanes, under mild conditions. 
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Turning up 

translation 
in fragile 

X syndrome 


Fragile X mental retardation 
protein promotes 
translation, contrary to 

its assumed function 


By Sameer Aryal’? and Eric Klann?* 


ragile X syndrome (FXS) is the most 
prevalent inherited form of intellec- 
tual disability and autism (J). FXS 
is usually caused by transcriptional 
silencing of the fragile X mental re- 
tardation 1 (FMRI) gene, which en- 
codes fragile X mental retardation protein 
(FMRP), an RNA-binding protein that is 
thought to repress the translation of spe- 
cific messenger RNAs (mRNAs). Precise 
translational control is especially critical 
in neurons because rapid synthesis of pro- 
teins from preexisting mRNAs underlies 
many forms of synaptic plasticity, which is 
altered in animal models of FXS (2). Prog- 
ress has been made in identifying RNAs 
that FMRP binds, but its functional targets 
and modes of translational control remain 
elusive, especially during development 
(3-7). On page 709 of this issue, Greenblatt 
and Spradling (8) use Drosophila melano- 
gaster oocytes to demonstrate that FMRP 
increases the translation of multiple long 
mRNAs, many of which are implicated in 
intellectual disability and autism. 

Several lines of evidence have suggested 
that FMRP inhibits the translation of cer- 
tain mRNAs. Increased net de novo protein 
synthesis is observed in multiple brain 
regions of mouse models of FXS and in 
cells derived from FXS patients (7). More- 
over, the expression of several proteins 
encoded by FMRP-binding mRNAs are 
overexpressed in the brains of mice that 
lack FMRP (2). However, some evidence 
also indicates that the loss of FMRP is as- 
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sociated with decreased syn- 
thesis and/or expression of 
certain proteins encoded by 
FMRP-binding mRNAs (2, 5, 
9), suggesting that it can also 
function to enhance mRNA 
translation. 

Building on investigations 
to identify mRNAs that bind 
to FMRP (3), genome-wide 
studies to identify translat- 
ing mRNAs from hippocam- 
pal CAl pyramidal neurons 
from FXS mice indicate that 
the translation of a substan- 
tial fraction of mistranslated 
mRNAs is down-regulated 
(6, 7). Surprisingly, there is 
limited overlap between the 
mistranslated mRNAs in neu- 
rons from FXS mice and those 
identified as FMRP-binding 
mRNAs. There are several 
possible explanations, which 
include the following: FMRP regulates an- 
other aspect(s) of metabolism of its mRNA 
targets; the mistranslating mRNAs are com- 
pensatory responses to the lack of FMRP 
(6); the loss of repression of FMRP target 
mRNAs causes secondary effects on trans- 
lational control pathways, which then alter 
translation (1); and FMRP regulates trans- 
lation in a cell type- and/or development- 
specific manner. 

Because mature D. melanogaster oocytes 
lack transcription, they must maintain the 
translation of preexisting mRNAs at a dis- 
tance from the nucleus, similar to neurons. 
Additionally, female flies can be manipu- 
lated to store mature oocytes rather than 
ovulate (10). This permits the investigation 
of functional FMRP targets in cells that rely 
exclusively on translational control for ex- 
tended periods of time. The authors found 
that germ cell-specific depletion of Fmri 
mRNA impaired the developmental capac- 
ity of stored oocytes in a time-dependent 
manner: Fmri-deficient oocytes stored for 
a single day hatched normally. However, 
only 20 to 30% of oocytes hatched after 10 
days of storage, and the resulting embryos 
showed defects in neural development. This 
suggests that FMRP plays a crucial role in 
maintaining the developmental compe- 
tence of oocytes when they are dependent 
on the translation of preexisting mRNAs. 

Greenblatt and Spradling carried out ri- 
bosome profiling on oocytes stored for 1 to 
2 days to pinpoint the positions of translat- 
ing ribosomes on individual mRNAs (11). 
They identified more than 400 genes that 
were differentially translated when F'mr1 
was depleted. Surprisingly, more than 
95% of the mistranslated genes showed 


FMRP 
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Fmr1 expressed 


FMRP enhances translation 
FMRP maintains the developmental capacity of D. melanogaster oocytes by promoting 
the translation of specific, mostly long, transcripts. The functional targets of FMRP, 

identified by reduction in ribosome loading with Fmr1 depletion, are enriched in genes 
encoding E3 ubiquitin ligases, including POE. 


decreased ribosome occupancy without al- 
terations in mRNA abundance, suggesting 
that FMRP functions to enhance, rather 
than repress, the translation of many genes 
in D. melanogaster oocytes. More than 10% 
of the mistranslated genes were orthologs 
of human genes that have been implicated 
in neuropsychiatric disorders, including in- 
tellectual disability and autism. This find- 
ing resonates with the long-held belief that 
FMRP regulates the translation of multiple 
autism-linked genes (3). Deeper examina- 
tion of the functions of these genes may 
aid in understanding the molecular basis 
for the comorbidity and variation in sever- 
ity of FXS and autism. 

The authors noted another remarkable 
feature of the mistranslating transcripts: 
They were longer than average. Nearly 
half of all mRNAs coding for long proteins 
(>1800 amino acids) had reduced ribosome 
loading when F'mr!I was depleted. Notably, 
dysregulation of long genes has been ob- 
served in multiple neuropsychiatric disor- 
ders associated with synaptic dysfunction, 
including FXS (72) and autism (13). Thus, 
it is critical to examine why there is a con- 
vergence on the inability to appropriately 
express long genes in such disorders. 

How are these findings reconciled with 
the previously assumed function of FMRP 
as a repressor of translation? Because 
FMRP expression in both mouse and 
D. melanogaster peaks during early devel- 
opment, it is possible that the enhancement 
of translation by FMRP is development 
specific. The authors speculate that the 
decreased synthesis of several E3 ubiquitin 
ligases, which target proteins for degrada- 
tion, after FMRP reduction increases the 


Published by AAAS 


Fmrl1 depleted 


life span of translation factors 
that mediate increased over- 
all de novo protein synthesis 
(see the figure). Indeed, the 
authors demonstrate that oo- 
cytes depleted of the E3 ubiq- 
uitin ligase protein purity of 
essence (POE), a translational 
target of FMRP, mimic those 
lacking Fmri. This indicates 
that E3 ubiquitin ligases may 
be therapeutic targets for the 


mRNAs encoding E3 i 
ubiquitin ee Ribosomes treatment of FXS. 

Given the overlap between 
mistranslating transcripts in 
oocytes and FMRP-binding 

Y i] mRNAs identified in mouse 

E3 ubiquitin brains, it is possible that 

O GQ O CO; ligases O Cp FMRP regulates the transla- 
y Y tion of some mRNAs by direct 

binding. However, further ex- 

@ O e) Target proteins & e% Oo periments are needed to better 


understand how FMRP estab- 

lishes RNA-binding specific- 
ity. Additionally, caution must be exercised 
when interpreting results that provide es- 
timates of protein synthesis via ribosome 
profiling. Because FMRP is thought to im- 
pede ribosome translocation, its loss may 
cause ribosomes to transit at increased rates 
on specific mRNAs, resulting in decreased 
steady-state ribosome loading but increased 
de novo translation of those mRNAs. Al- 
though the authors rule this out for some 
mRNAs by examining their protein abun- 
dance, it is essential that whole-genome 
approaches be used to examine how FMRP 
modulates ribosome transit on individual 
mRNAs. Greenblatt and Spradling’s findings 
add an intriguing new piece to the puzzle of 
identifying the functional targets of FMRP. 
In the future, it may be possible to identify a 
core set of mistranslated mRNAs that medi- 
ate pathophysiology in FXS and other forms 
of autism and intellectual disability. 


REFERENCES AND NOTES 


1. J.C. Darnell, E. Klann, Nat. Neurosci. 16,1530 (2013). 
2. M.R.Santoro etal.,Ann. Rev. Pathol. Mech. Dis. 7,219 
(2012). 
3. J.C.Darnelletal., Cell 146, 247 (2011). 
4. M.Ascanoetal., Nature 492, 382 (2012). 
R. Tabet et al., Proc. Natl. Acad. Sci. U.S.A. 113, £3619 
(2016). 
S.R. Thomson et al., Neuron 95,550 (2017). 
L. Ceolin et al., Front. Mol. Neurosci.10, 340 (2017). 
E. J. Greenblatt, A.C. Spradling, Science 361, 709 (2018). 
9. C.Grossetal.,J. Neurosci. 31,5693 (2011). 
C.A. Jeffreys et al., Curr. Biol. 13, 498 (2003). 
ll. N.T.Ingoliaetal., Science 324, 218 (2009). 
.L. Ouwenga, J. Dougherty, Mol. Autism 6, 16 (2015). 
13. M.J.Zylkaetal., Neuron 86, 353 (2015). 


pS 
es) 


ACKNOWLEDGMENTS 


This work was supported by National Institutes of Health grants 
NS034007, NS047384, and HD082013. 


10.1126/science.aau6450 


17 AUGUST 2018 * VOL 361 ISSUE 6403 649 


8LOz ‘91 IsNBny uo /fio Hewadualdsa0ua!0S//:d}}y Wo PapeojuMOG 


Global warming policy: Is 
population left out in the cold? 


Population policies offer options to lessen climate risks 


By John Bongaarts! and Brian C. O’Neill?* 


ould slowing human population 
growth lessen future impacts of 
anthropogenic climate change? 
With an additional 4 billion peo- 
ple expected on the planet by 
2100, the answer seems an ob- 
vious “yes.” Indeed, substantial scientific 
literature backs up this intuition. Many 
nongovernmental organizations undertake 
climate- and population-related activities, 
and national adaptation plans for most of 
the least-developed countries recognize 
population growth as an important com- 
ponent of vulnerability to climate impacts 
(1). But despite this evidence, much of the 
climate community, notably the Intergov- 
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ernmental Panel on Climate Change (IPCC), 
the primary source of scientific information 
for the international climate change policy 
process, is largely silent about the potential 
for population policy to reduce risks from 
global warming. Though the latest IPCC re- 
port (2) includes an assessment of techni- 
cal aspects of ways in which population and 
climate change influence each other, the 
assessment does not extend to population 
policy as part of a wide range of potential 
adaptation and mitigation responses. We 
suggest that four misperceptions by many 
in the climate change community play a 
substantial role in neglect of this topic, and 
propose remedies for the IPCC as it pre- 
pares for the sixth cycle of its multiyear as- 
sessment process. 

Population-related policies—such as of- 
fering voluntary family planning services 
as well as improved education for women 
and girls—can have many of the desirable 
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characteristics of climate response options: 
benefits to both mitigation and adaptation, 
co-benefits with human well-being and 
other environmental issues, synergies with 
Sustainable Development Goals (SDGs), 
and cost effectiveness. These policies can 
also enable women to achieve their desired 
family size, and lead to lower fertility and 
slower population growth (3). The resulting 
demographic changes can not only lessen 
the emissions that drive climate change but 
also improve the ability of populations to 
adapt to its consequences. 


MISPERCEPTION 1: POPULATION GROWTH 
IS NO LONGER A PROBLEM 
The population growth rate of the develop- 
ing world increased sharply in the 1950s and 
1960s, resulting in a doubling of the world 
population from 3 billion in 1960 to over 6 
billion in 2000 (4). The main cause of this 
acceleration was the spread of public health 
measures, which rapidly reduced death rates 
while birth rates remained high. Slowing this 
population expansion became a top priority 
for the global development agenda. In the 
1970s and 1980s, substantial international 
assistance was invested in voluntary family 
planning programs to reduce fertility. 

This early consensus on population 
policy ended in the 1990s when interest 
and international support waned for rea- 
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Acommunity health worker shows women 
how to use a condom in Bangladesh. 


sons including (i) the belief that fertility 
declines already under way in Asia and 
Latin America would soon occur in Africa; 
(ii) the expectation that high AIDS mortal- 
ity would halt population growth in sub- 
Saharan Africa; (iii) the failure of earlier 
dire predictions, such as worldwide famine, 
to materialize; and (iv) the call from the 
1994 International Conference on Popula- 
tion and Development (ICPD) to empha- 
size reproductive health and rights over 
demographic aims. As a result, funding for 
reproductive health issues (e.g., maternal 
care, safe delivery, sexually transmitted dis- 
eases, and female genital cutting) rose and 
funding for family planning programs de- 
clined. These issues were widely debated in 
the 1990s and 2000s and may have affected 
the coverage of population policy in IPCC 
reports in part because population growth 
no longer seemed an urgent matter. 

Over the past decade, two unexpected 
developments have led to renewed concern 
about future population growth, particularly 
in sub-Saharan Africa. Fortunately, AIDS 
mortality has dropped sharply as treatment 
has become more accessible worldwide. In 
addition, and contrary to expectations, birth 
rates across sub-Saharan Africa have re- 
mained high, and declines in birth rates have 
stalled in several countries. As a result, the 
latest UN world population projection is the 
highest ever, expecting 11.2 billion in 2100, a 
nearly 4 billion rise from 2015 (4). Much of 
this rise is projected in sub-Saharan Africa 
(from 1 billion in 2015 to 4 billion in 2100), but 
Asia (excluding East Asia) and Latin America 
are also projected to grow substantially. 


MISPERCEPTION 2: POPULATION POLICIES 
ARE NOT EFFECTIVE 
Population policies generally recommend a 
range of interventions that influence fertil- 
ity trends, either directly or indirectly. Fam- 
ily planning programs to assist women in 
achieving their reproductive goals for lim- 
iting or spacing births have been the main 
population intervention adopted by gov- 
ernments (5). These programs have been 
successful in a number of countries, but 
further investments are still needed. Each 
year, about 85 million unintended pregnan- 
cies result in 32 million unplanned births 
worldwide; the large majority of these (28 
million) in the developing world (6). Popula- 
tion growth could be reduced substantially 
by avoiding these unplanned pregnancies. 
The vast majority of unintended preg- 
nancies occur among women who want to 
avoid pregnancy but are not using effective 
contraception. Reasons for non-use include 
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lack of access to services and the high costs 
of modern methods. Fear of side effects of 
methods, disapproval of partners, and re- 
luctance to violate social norms are also 
substantial barriers to use (7). 

Voluntary family planning programs de- 
signed to be responsive to cultural customs 
reduce these obstacles by increasing access, 
providing subsidies, and expanding method 
options. Well-run voluntary programs have 
contributed to sustained declines in fertil- 
ity and population growth across Asia, the 
Middle East, and Latin America and in 
some countries in Africa (5, 7). 

Although many studies have assessed 
outcomes of family planning programs, 
these mostly examined near-term effects 
on raising contraceptive use and reduc- 
ing birth rates. Very few studies estimate 
impacts of family planning programs on 
longer-term demographic trends, in part 
because isolating the exact impact of pro- 
grams versus socioeconomic development 
and other factors is not straightforward 
in most countries. The potential impact of 
these programs on long-range population 
size is illustrated by comparing Bangladesh 
and Pakistan, which had almost the same 
population size in 1980. Bangladesh then 
implemented one of the world’s most effec- 
tive voluntary family planning programs. 
By contrast, Pakistan’s program was con- 
siderably weaker, lacking government com- 
mitment. As a result, fertility trajectories 
differed substantially from 1980 onward, 
resulting in increasingly large differences in 
population size over time (see the figure). 
By 2100, Pakistan’s population is projected 
to be double the size of Bangladesh’s. This 
suggests that the Bangladesh family plan- 
ning program led to a large cut in the coun- 
try’s potential 2100 population. Fertility and 
population trends are also affected by levels 
of socioeconomic development, but this is 
unlikely to be the dominant explanation 
for the different trajectories (see supple- 
mentary materials). Development levels as 
measured by the Human Development In- 
dex are nearly the same for Bangladesh and 
Pakistan (8), which are both poor majority- 
Muslim countries in South Asia. 

High-quality voluntary family planning 
programs can also have a large impact in 
Sub-Saharan Africa. Programs implemented 
in the early 2000s in Ethiopia, Malawi, and 
Rwanda have already resulted in sharp de- 
clines in fertility. Unfortunately, in many 
African countries, including Nigeria, family 
planning is still given low priority. Existing 
programs in Asia and Latin America also 
could be improved. 

In addition to improving the health and 
welfare of women, families, and communi- 
ties, reduced fertility assists in eliminat- 
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ing poverty and reducing pressure on the 
natural environment. Family planning 
programs are one of the most cost-effective 
health and development investments avail- 
able to governments (9). 

Population policies often include other 
relevant programmatic interventions such 
as education and empowerment of women. 
These are not only important development 
interventions, but they also accelerate fer- 
tility decline (/0). Of course, educated and 
empowered women must have access to 
contraception to regulate their fertility. 


MISPERCEPTION 3: POPULATION DOES NOT 
MATTER MUCH FOR CLIMATE 

The emissions and land use that drive climate 
change are a result of income and consump- 
tion growth, technological change, changes 
in economic structure, related policies, and 
other factors. Past and current emissions 
have been attributed primarily to economic 
growth powered by fossil fuels in the cur- 
rently high-income countries. However, mul- 
tiple studies using increasingly sophisticated 
methods have demonstrated that population 
plays an important role both in historical 
and projected future emissions (17). Although 
slower future population growth would 
not be the most important means of reduc- 
ing future emissions, it could reduce global 
emissions by 40% or more in the long term 
(12). Slower population growth and associ- 
ated changes in age structure can also have 
positive economic effects that would tend to 
drive greenhouse gas emissions up, but these 
effects do not appear to be large enough to 
offset the emissions reduction produced by 
the slower population growth (see supple- 
mentary materials). 

The potential emissions reduction is 
large even though policy-induced declines 
in population growth would be largest in 
regions that currently have low per-capita 
emissions. Anticipated future growth in in- 
comes and energy use in these developing 
regions explain this result. Over the next 
few decades, overall emissions from low- 
income countries are likely to rise because 
of a rise in emissions per capita from rapid 
industrialization, as well as because of in- 
creasing population. 

Slower population growth is also an- 
ticipated to reduce climate change risks 
by freeing up resources for adaptation. Im- 
provements in education and health, which 
can both lead to and result from slower 
population growth, can reduce vulnerability 
to natural disasters and climate risks (13). 
Population-related policies that affect the 
spatial distribution of population, including 
those influencing immigration, labor mobil- 
ity, urbanization, and coastal development, 
can also affect vulnerability. 
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The IPCC itself has partially assessed 
the topic, concluding that population 
growth, urbanization, and changes in 
age structure are important drivers of 
emissions. It has also concluded that 
demography shapes the exposure and 
vulnerability of populations to climate 
impacts and can limit, or facilitate, the 
ability of society to adapt to those im- 
pacts. What is missing is an account- 
ing of how reductions in population 
growth might play a role in responses 
to the climate issue (see supplemen- 
tary materials). 

Many in the climate policy commu- 
nity currently focus on achieving sub- 


Population estimates and projections 
for Bangladesh and Pakistan 

Differences suggest that a good family planning program (as 

in Bangladesh beginning in the 1980s) can have a large impact 
on population trajectories in the long term. Data are from (4). 
See supplementary materials for details on data and methods. 
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of the literature on mitigation and ad- 
aptation options. Although the outline 
for the sixth IPCC assessment report 
has already been agreed upon (with no 
explicit mention of population policy), 
there is ample opportunity within its 
structure to assess literature on popu- 
lation policy as a component of mitiga- 
tion or adaptation responses, as well as 
its costs and benefits, implementation 
barriers, and links to SDGs (see supple- 
mentary materials). The IPCC should 
also consider the inclusion of more so- 
cial scientists experienced in reproduc- 
tive health and population policy. 
Beyond the IPCC, the climate and 


stantial emissions reductions in the 
near future. Although slowed popula- 
tion growth would contribute only mod- 
estly in the short term, its cumulative 
effect over the 21st century would be sub- 
stantial. Slowed population growth would 
reduce emissions and the demand for energy 
that would have to be satisfied with low- or 
zero-carbon sources. It would therefore also 
have an important effect on the scale of the 
energy system ultimately required under a 
stabilized climate. 


MISPERCEPTION 4: POPULATION POLICY IS 
TOO CONTROVERSIAL TO SUCCEED 

Family planning programs have attracted 
criticism since their initiation. The most 
persistent opposition has come from con- 
servative religious and social groups. One 
of their main concerns is that making con- 
traception readily available encourages 
promiscuity and leads to a breakdown of 
family life. In addition, the Catholic Church 
opposes artificial contraception, and Islam 
opposes sterilization. These religious con- 
cerns are shared by conservative political 
allies, leading to frequent controversy. 

Other concerns about family planning 
programs have been raised by human rights 
advocates who fear coercion. They point to 
examples of massive abuses by the Chinese 
government during the implementation of 
the one-child policy and by the Indian gov- 
ernment during an emergency period in the 
late 1970s. These abhorrent practices were 
and are universally condemned. Neverthe- 
less, reproductive health remains a political 
issue in many countries, and constraints on 
women’s choices continue to exist (e.g., lim- 
ited choice of contraceptive methods or lack 
of services). 

A key point of sensitivity is that family 
planning programs largely aim to reduce fer- 
tility in the developing world while people 
in the developed world, which is primar- 
ily responsible for causing the climate to 
change, continue their excessive emission of 
greenhouse gases. At the same time, many 
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developed countries are also increasingly 
concerned about aging and decline of their 
populations. Many in the climate change 
community believe that entering into a popu- 
lation policy discussion thus blames the poor 
countries for problems created by the rich 
countries. Although this belief is real, it does 
not change the fact that population growth 
in developing countries poses challenges for 
climate and development and deprives the 
international community of an important 
policy lever to improve human welfare. 

Although these controversies do indeed 
exist, they are not the obstacles to program 
implementation that some in the climate 
community believe them to be. Govern- 
ments around the world now support the 
conclusions of the ICPD, confirmed by 
the SDGs, which call for a human rights- 
based approach and for women everywhere 
to have the right to freely choose when 
and how often to get pregnant (J4). Many 
countries in Asia and Latin America have 
invested in family planning programs, and 
increasing numbers of governments in sub- 
Saharan Africa have started such programs. 
There is widespread agreement among gov- 
ernments and international organizations 
that family planning programs are a valu- 
able investment. The SDGs in fact call for 
more such services. But these programs are 
often given low priority because they are 
considered a health investment rather than 
an investment with wide-ranging socioeco- 
nomic and environmental benefits. 


POLICY LEVERS 

Rapid population growth is one of the key 
drivers of emissions and one of the determi- 
nants of vulnerability to impacts; it therefore 
should be considered as a potential climate 
policy lever. A key first step in remedying the 
current neglect of the issue is for the IPCC 
to include population policy in its assessment 
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environmental communities and in- 
ternational development institutions 
should embrace scientifically sound 
analyses of population policy and hu- 
man rights-based reproductive health pro- 
grams. Other international environmental 
assessments (11, 15) have done somewhat 
better than the IPCC in covering this topic. 
Given the urgency of addressing climate 
change, all available options, especially those 
that have multiple benefits for sustainable 
development, should be assessed by experts 
and considered by governments. 
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An island on the brink 


Convinced erosion, not climate change, threatens their 
home, a community grapples with an uncertain future 


By Andrew J. Miller 


angier Island, located in the middle 
of Chesapeake Bay, is one of the 
mainstays of the region’s blue crab 
fishery. No location on the small is- 
land is more than 5 feet above sea 
level, and two-thirds of its land 
mass has been lost since 1850. Much of 
its upland is being converted 
to tidal marsh as_ sea-level 
rise accelerates while the land 
subsides at one of the highest 
rates in the mid-Atlantic re- 
gion. In 2015, Schulte et al. ar- 
gued that without government 
action, the citizens of Tangier 
Island “may become among 
the first climate change refu- 


CHESAPEAKE 
REQUIEM 


Chesapeake Requiem 


between December 2015 and October 2017, 
documenting how the island and its peo- 
ple were responding to climate change. 
This book is a chronicle of what he learned 
about the lives of Tangier Islanders and 
about their history, culture, and beliefs, 
set within the context of what is known 
about the likely fate of this vulnerable, iso- 
lated island. 

Swift paints vivid portraits of 
both the natural environment 
and the individuals and insti- 
tutions of this close-knit com- 
munity, now fewer than 500 in 
number. Most residents depend 
on the fishery for their liveli- 
hood. Many share an ancestry 
on the island going back gen- 
erations, and most share a deep 


: , Earl Swift fe : : 
gees in the continental USA” Dey Street Books, Christian faith, a strict moral 
(1). It is anticipated that the 2018. 448 pp. code, and a commitment to hard 


island may become uninhab- 
itable within 50 years and, given more 
recent updates in anticipated rates of 
sea-level rise, perhaps within 25 years. 
Earl Swift picks up this theme in Chesa- 
peake Requiem, warning that the very wa- 
ters on which Tangier Islanders depend 
represent a threat to the community’s fu- 
ture. Swift spent a great deal of time there 


The reviewer is at the Department of Geography and 
Environmental Systems, University of Maryland, Baltimore 
County, Baltimore, MD 21250, USA. Email: miller@®umbc.edu 
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work in a challenging environ- 
ment. As they observe rapid changes in the 
landscape around them, the islanders fear 
the loss of the land they call home, and 
they ask and pray for both government 
assistance and divine intervention to fore- 
stall what most scientists regard as virtu- 
ally inevitable. 

Swift’s account of his time on the is- 
land is roughly chronological, inter- 
spersed with passages recounting the 
island’s history and sections that cite the 
scientific literature about the ongoing 
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The waters on which many residents depend for crab 
fishing may soon render Tangier Island uninhabitable. 


geologic evolution of the bay, predicted 
trends in sea-level rise, and the life cycle 
of the blue crab. He demonstrates a deep 
affection and admiration for the Tangier 
Islanders, who are often cantankerous 
but whom he also describes as “warm, 
loving, generous” and “remarkably resil- 
ient, hardworking, and courageous.” As 
he illustrates in one of the more harrow- 
ing and moving episodes recounted in the 
book, they are even willing to risk their 
lives for one another. 

One of the enduring mysteries docu- 
mented in this book is how social and 
religious attitudes on Tangier Island and 
elsewhere have become so entwined with 
an overt refusal to believe the scientific ev- 
idence in support of the rapidly accelerat- 
ing human impact on climate and sea-level 
rise. As Swift reveals, Tangier Islanders 
understand the effect of wind and waves 
on erosion but steadfastly maintain that 
human-induced climate change and sea- 
level rise are not real. Those he interviews 
believe that a government-funded seawall 
would save their island. 

Swift clearly does believe the science. He 
points out that thousands of coastal com- 
munities in the United States (and many 
more around the world) either are or will 
soon be at risk and that it is both physi- 
cally and economically impossible to save 
them all. He suggests that how we choose 
to respond to Tangier Island will say much 
about what we hold important. 

Recent news reports suggest that Tang- 
ier Island may be in line for a government- 
funded coastal protection project (2). The 
previous administration awarded funding 
to relocate the “climate refugees” of Isle 
de Jean Charles in Louisiana (3). Looking 
to the future, these types of decisions will 
need to be based on an understanding of 
the rate and consequences of sea-level rise 
and of the cumulative costs and trade-offs 
that are involved when multiple communi- 
ties are threatened (4). 

This book offers a well-rounded portrait 
of a rural community both dependent on 
and threatened by its natural environ- 
ment—and a description of a problem that 
unfortunately will become commonplace 
over the coming decades. 
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Our looming lead problem 


Neglect, poor planning, and bad decisions led to Flint’s 
water crisis. It could easily happen again. 


By Frederick Rowe Davis 


n 25 April 2014, Flint Mayor Dayne 

Walling ceremoniously shut off a 

valve to the Detroit water supply 

and opened the flow of the Flint 

River into local homes and _ busi- 

nesses. He marked the occasion by 
drinking from a glass filled with water 
from the new source. Eighteen months 
later, the water supply was switched back 
to Detroit. What occurred in between—the 
city’s failure to control infrastructure cor- 
rosion, the deterioration of fresh water en- 
tering Flint residences, citizen complaints, 
government denials, elevated lead levels 
in children, and public outcry—would 
become the basis for a crisis that rose to 
national attention in 2015. What the Eyes 
Don’t See by Mona Hanna-Attisha and The 
Poisoned City by Anna Clark are the first 
two books to document in detail this hor- 
rible disaster. 

In What the Eyes Don’t See, the Flint 
story unfolds from the strikingly personal 
perspective of Hanna-Attisha, the pediatri- 
cian and activist who documented elevated 
blood lead levels in the city’s children. Born 
in England to Iraqi immigrants, Hanna- 
Attisha moved with her family to Flint 
when she was a child. Her brother, an at- 
torney, has represented whistle-blowers 
in corporate fraud cases. A distant cousin 
was the first bacteriologist of Middle East- 
ern descent to work in the United States. A 
friend from high school became an environ- 
mental engineer at the U.S. Environmental 
Protection Agency. These relationships, we 
learn, bolster Hanna-Attisha’s commitment 
to her community. 

Yet, “Dr. Mona,” as she is known, never 
strays too far from the stories of her young 
patients in Flint, whom she refers to as 
her “Flint kids.” With names changed for 
privacy, the children’s stories emphasize 
the importance of tap water to daily life 
and the long-term risks associated with 
lead exposure. 

Hanna-Attisha’s writing style is direct 
and passionate. Her descriptions of the 
events that unfolded and the people in- 


The reviewer is at the Department of History, 
Purdue University, West Lafayette, IN 47907 USA. 
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volved reverberate with candor. She pulls 
no punches, directing some of her stron- 
gest criticism at Charles Kettering, whom 
she calls “Public Health Enemy #1.” 

Head of research at General Motors 
(GM) from 1920 to 1947, Kettering was the 
driving force behind the world’s embrace 
of leaded gasoline during the mid-20th 
century. Despite known alternatives, Ket- 
tering promoted tetraethyl lead, knowing 
GM could patent the additive. Ironically, 
and despite his role in the history of lead 
poisoning, there are tributes to Kettering 
all over Flint. 

If Kettering is the villain, Alice Hamil- 
ton is Hanna-Attisha’s heroine. A physi- 
cian and research scientist during the early 
20th century, Hamilton fought the use of 
lead in gasoline and advocated for safety 


What the Eyes Don’t See 
A Story of Crisis, 
Resistance, and Hope in 
an American City 

Mona Hanna-Attisha 
One World, 2018. 

378 pp. 


The Poisoned City 
Flint’s Water and the 
American Urban Tragedy 
Anna Clark 
Metropolitan Books, 
2018. 318 pp. 
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Lead is ubiquitous in American infrastructure. 
Minute amounts in water can accumulate to toxic 
levels, leading to cognitive deficits. 
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standards that would protect workers ex- 
posed to it from its effects. Hamilton’s 
passionate fight clearly inspired Hanna- 
Attisha, who used ZIP codes to demon- 
strate that children living in areas that 
depended on water from the Flint river ex- 
hibited elevated blood lead levels. 

The Poisoned City centers the story on 
Flint families and their advocacy on behalf 
of their children. Clark notes that Flint’s 
water crisis did not result from corporate 
malfeasance nor natural disaster. Rather, 
she argues, it arose as a direct result of 
neglect (at times willful), poor planning, 
and bad decision-making (the latter in- 
cluding city officials’ decision to violate 
the Safe Drinking Water Act, followed by 
months of delay and cover-up at all levels 
of government). 

But the story of Flint is also one of re- 
silience. Residents rose up and challenged 
the indifference and false palliatives of- 
fered by the authorities, who for many 
months proclaimed that the water supply 
in Flint was safe for drinking. 

Clark provides context to the Flint di- 
saster with informative sections on race in 
America, the rise and fall of cities, and en- 
vironmental justice. She details events and 
developments as recent as the spring of 
2018, including positive signs of reinvest- 
ment in Flint’s social and economic infra- 
structure. (Hanna-Attisha provides similar 
details, often filtered through the experi- 
ences of immigrants to America.) 

Both The Poisoned City and What the 
Eyes Don't See offer day-by-day (at times 
moment-by-moment) accounts of the inex- 
orable development of the conditions that 
led to the Flint disaster. If Hollywood were 
to produce a film inspired by the crisis, 
one can imagine scenes being interspersed 
with graphics revealing specific dates and 
times. Both books would have benefited 
from a similar device, if only to underscore 
how quickly the crisis emerged. 

The water crisis in Flint is profoundly 
worrisome: Numerous children suffered 
lead poisoning as a direct result of a bu- 
reaucratic focus on the fiscal rather than 
the social. With the huge amount of lead 
incorporated into the nation’s infrastruc- 
ture, many other communities are just a 
few poor decisions from a similar fate. 

Double reviews prompt the question of 
which book to read. My recommendation 
is to read both. What the Eyes Don’t See 
and The Poisoned City reverberate with 
crucial details, profound insights, and in- 
spiring conclusions. This is a rare oppor- 
tunity to read two superb—yet strikingly 
distinct—accounts of a national tragedy. & 
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Harassment charges: 
Enough himpathy 


We are well into the #Metoo era, yet journal- 
ists and editors are still fixated on the 
harasser’s fall from grace rather than the 
detrimental effect of sexual harassment 
on the victims and our society as a whole. 
The News story “Prominent geneticist out 
at UC Irvine after harassment finding” 
(M. Wadman, 29 June, https://scim.ag/ 
AyalaResignation) reinforces a familiar 
toxic narrative: The accomplishments of 
the harasser hold more value to science 
than women’s right to a safe workplace. 
This is now so commonplace that it has 
been dubbed “himpathy” (1). 

In the News story, Wadman tells us all 
about the “eminent” professor, from his 
scientific accomplishments to his personal 
hobbies. He did “pioneering” and “ground- 
breaking” work, he donated money to the 
university, and he was president of AAAS 
(the publisher of Science). However, we do 
not hear about the pioneering work of the 
women he harassed at University of Califor- 
nia, Irvine (UCI). From a graduate student 
to a tenured professor to an assistant dean, 
the News story reduced the women who 
demanded an end to his misconduct to 
complainers. We are told that Ayala was just 
being “European” and his actions were mis- 
understood; instead, the narrative should 
focus on the many women and careers that 
suffered from Ayala’s actions. Wadman then 
chose to end the article by quoting an Ayala 
supporter who diminished the investigation. 
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The same himpathy sentiments return in 
the follow-up News In Depth story “Report 
details harassment by famed biologist” (M. 
Wadman, 27 July, p. 316). Words matter, 
and Science should wield its words and 
influence carefully. It is time to recognize 
that harassers have taken a substantial toll 
on the advancement of science. It is time to 
acknowledge that sexual harassment in all 
its nefarious forms puts an unquantifiable 
burden on the victims (many of whom are 
our colleagues). It’s time to believe women. 


Jane Zelikova, Kelly Ramirez,* 
Jewel Lipps, on behalf of 500 Women 


Scientists leadership 

500 Women Scientists, Boulder, CO 80303, USA. 
*Corresponding author. 

Email: k.ramirez@nioo.knaw.nl 
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Harassment charges: 
Journalists’ role 


As a woman in science, I find the report- 
ing of allegations against Francisco Ayala 
(“#Metoo complaints fell noted geneticist,” 
M. Wadman, News In Brief, 6 July, p. 8) 
deeply troubling. The title implicates the 
“complaints” or “complainers” as respon- 
sible for the resignation, as opposed to 
the actions of the accused or the impartial 
sexual harassment investigation. The term 
“complaint” gives the impression that the 
alleged victims have minor grievances as 
opposed to serious reports of scientific 
misconduct. Not only is this title biased 
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UCI decided to rernove 
Francisco Ayala’s 

name from the science 
library after he 
resigned amid sexual 
harassment allegations. 


against whistleblowers, it is also factu- 
ally incorrect, as the first allegations of 
harassment were allegedly made 3 years 
before the resignation. I assert that the 
investigation, not the allegations, caused 
the accused to resign. 

Jessica Duffy 

Department of Biology, Midwestern State University, 


Wichita Falls, TX 76308, USA. 
Email: jlduffyl009@my.mwsu.edu 
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Harassment charges: 
Injustice done? 


We are deeply concerned by the way in 
which our friend and colleague Professor 
Francisco Ayala has been forced to resign 
from the University of California, Irvine 
(UCD, after being accused of sexual harass- 
ment (“Prominent geneticist out at UC 
Irvine after harassment finding,” M. 
Wadman, News, 29 June, https://scim.ag/ 
AyalaResignation). The charges that have 
been raised against him have had appall- 
ing consequences. 

Those of us who are well acquainted 
with Professor Ayala know that he is an 
honorable person, who throughout his 
career has treated his friends, co-workers, 
and students in a respectful, egalitar- 
ian way. His lifelong commitment to 
teaching, research, and outreach on bio- 
logical evolution has won him worldwide 
recognition. He has been a generous bene- 
factor to the University of California and 
throughout his fruitful career has opened 
new fields of biological research, pro- 
moted mutual respect and independence 
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between evolutionary studies and religious 
perspectives, played a key role in several 
major scientific organizations, and helped 
many Spanish-speaking female scholars and 
Hispanic students, in particular, both in the 
United States and throughout the world. 
From the available information, it 
appears that the inquiry conducted by 
UCI lacked genuine due process, fair- 
ness, and full transparency. We urge UCI 
to acknowledge the possibility that its 
sanctions against Professor Ayala were 
enacted in haste and to reopen the case 
and investigate the matter more thor- 
oughly. We understand the wish of both 
the institution and Professor Ayala not 
to unduly prolong this whole unhappy 
episode. It is equally important, however, 
that justice be done and be seen to be 
done. If carried out properly, UCI could 
help devise a more successful model for 
how institutions should deal with such 
situations in the future. Devising an 
improved procedure for these cases would 
earn everyone’s gratitude. 
Andrés Moya and 62 additional authors* 
Institute of Integrative Systems Biology, University 
of Valencia, 46010 Valencia, Spain, and Foundation 
for the Promotion of Health and Biomedical 
Research of Valéncia Region (FISABIO), 46020 


Valéncia, Spain. Email: andres. moya@uv.es 
*The full list of authors is available online. 


SUPPLEMENTARY MATERIALS 


Full list of authors 
www.sciencemag.org/content/361/6403/655.3/supp|/DC1 


10.1126/science.aau7088 


Harassment charges: 
Metoo but due process 


The resignation of eminent biologist 
Francisco J. Ayala amid charges of sexual 
harassment (“Prominent geneticist out at 
UC Irvine after harassment finding,” M. 
Wadman, News, 29 June, https://scim.ag/ 
AyalaResignation) has left Ayala’s home 
campus deeply divided. The controversy 
highlights problems in the way universities 
currently address charges of sexual harass- 
ment. Because the University of California, 
Irvine (UCI), failed to post easily acces- 
sible guidelines on how the Ayala case was 
handled, especially how and by whom final 
decisions were made, many UCI faculty are 
concerned that the university overreached, 
imposing a punishment not commensurate 
with the specific charges of wrongdoing. A 
zero-tolerance policy for sexual harassment 
should include clearly stated procedures 
to protect due process and ensure propor- 
tional responses to wrongdoing. 

I will not engage in the ugly nastiness 
of “he said, she said” as we debate the 
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veracity of accusations described in sala- 
cious detail (“Report details harassment 
by famed biologist,’ M. Wadman, News In 
Depth, 27 July, p. 316). These are human 
tragedies evolving in front of us. Victims 
strive to regain lost self-esteem, along with 
justice. The accused wrestle with shock, 
forced to confront their own cluelessness at 
shifting mores, and everyone realizes how 
vulnerable we all are, including adminis- 
trators struggling to figure out how best 

to fairly confront sexual harassment and 
gender equality. We are all at risk when 
goodwill and communications break down. 

To fairly and equitably strike a balance 
between complacency and overreaction, 

I recommend three procedures to help 
achieve the notoriously difficult goal of 
changing attitudes toward gender. First, 
we must do more to foster a university 
climate that takes seriously and protects 
potential targets, be they male or female. 
Part of that is recognizing that even subtle 
forms of verbal behavior—i.e., jokes or 
comments about appearance—make some 
women and minorities uncomfortable. 
Yet resulting deferential treatment can 
contribute to a climate in which women 
or minorities are treated differently. Even- 
tually, this subtle, differential treatment 
can foster continuing inequality. Second, 
we must insist that legally correct poli- 
cies and procedures are followed when 
charges of sexual harassment are made. 
These procedures need to be transpar- 
ent and the same for all faculty. Policies 
should be written in prose that even 
nonlawyers can understand and posted 

in obvious places, with regularly sched- 
uled discussion forums designed to help 
educate all members of the university 
community. Third, the punishment must 
fit the crime. If tasteless, off-color jokes 
and the kind of ambiguous “unwanted 
touching” of which Ayala was accused 
warrant his public humiliation, what do 
we do with more serious charges of sexual 
harassment? And why is Hana Ayala pun- 
ished for her husband’s acts by having her 
name removed from gifts to UCI? 

The #Metoo movement has done a great 
service in opening up an area too long 
taboo. But the failure to follow clearly es- 
tablished, fairly administered, and trans- 
parent procedures can too easily produce 
witch hunts that cast doubts on legitimate 
charges of sexual harassment. This will set 
back the move toward gender equality, in 
the academy and in society at large. 
Kristen Renwick Monroe 
Department of Political Science, University of 


California, Irvine, Irvine, CA 92697, USA. 
Email: krmonroe@uci.edu 
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When fewer workers 
are more efficient 


narrow passageway can easily 
become clogged or jammed if 
too much traffic tries to enter 
at once or there is competition 
between the flow of traffic in 
each direction. Aguilar et al. studied 
the collective excavation observed 
when ants build their nests. Because 
of the unequal workload distribu- 
tion, the optimal excavation rate 
is achieved when a part of the ant 
collective is inactive. Numerical 
simulations and the behavior of 
robotic ants mimic the behavior of 
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Pest subverts host 
plant’s foraging 

Plants need iron as a micronutri- 
ent, and they extract it from the 
rhizosphere by secreting chelat- 
ing agents. Insect pests, such 

as the western corn rootworm, 
which annually cause millions of 
dollars’ worth of lost yield, need 
iron, too. Hu et al. show that the 
rootworm exploits the plant's 
own iron-foraging system to 
detect its host and to seize iron 
for itself (See the Perspective by 
Kliebenstein). Plants produce 
benzoxazinoid compounds not 
only as a defense against many 


insects but also as iron chelators. 


Rootworm larvae are not harmed 
by benzoxazinoids; instead, they 
take advantage of their presence 
as a signal that food is near and 
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of their properties as an iron 
chelator. —PJH 
Science, this issue p. 668; 
see also p. 642 


Ashrinking marine 
refrigerator 


Low subtropical marine clouds 
scatter solar radiation back 

to space and thereby cool the 
climate system. Most work 

on understanding changes in 

the coverage of these types 

of clouds has focused on the 
effects of sea surface tempera- 
tures or on aerosols. Yuter et al. 
show that dynamic effects due 
to atmospheric gravity waves are 
responsible for the rapid clearing 
of large areas of these clouds. 
This phenomenon also has 


implications for marine ecology 
and biogeochemistry. —HJS 


Science, this issue p. 697 


Controlling two- 
dimensional twist 


In heterostructures assembled 
from two-dimensional materi- 
als such as graphene, electron 
tunneling between layers varies 
strongly with the rotation 

angle between the crystal lat- 
tices. Usually, the twist angle 
between layers is fixed after 
assembly. Ribeiro-Palau et al. 
encapsulated graphene with 
boron nitride, but the top boron 
nitride flake was shaped so that 
an atomic force microscope 

tip could push on it to vary the 
twist angle by as little as 0.2° 
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the colony. —MSL 


Science, this issue p. 672 


Ants optimize their excavation rate in 
narrow tunnels when only some are active. 


They observed variations with 
twist angle in properties such 
as the charge neutrality point, 
which would be difficult to 
observe in static rotated struc- 
tures. —PDS 


Science, this issue p. 690 


Liquid droplets 
step on the cGAS 


Spontaneous partitioning of 

a homogeneous solution of 
molecules, or liquid-phase 
separation, underlies the forma- 
tion of cellular bodies from P 
granules to nucleoli. Essentially, 
dense-phase liquid droplets 

act like cellular compartments. 
Du et al. show that DNA bind- 
ing to its cytoplasmic sensor, 
cyclic GMP-AMP synthase 
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(cGAS), results in liquid droplets 
containing activated cGAS (see 
the Perspective by Ablasser). 
This phenomenon occurs 
through multivalent interac- 
tions, augmented by zinc, 
between DNA binding domains 
on cGAS and DNA ina length- 
dependent manner. Binding 
triggers a switchlike reaction 
that concentrates the enzyme 
and reactants to enhance 
STING-dependent interferon 
responses. —STS 

Science, this issue p. 704; 

see also p. 646 


A tale of not-so-pure tails 
The poly(A) tail of mRNA has 
been thought to be a pure stretch 
of adenosine nucleotides with 
little informational content except 
for length. Lim et al. identified 
enzymes that can decorate 
poly(A) tails with non-A nucleo- 
tides. The noncanonical poly(A) 
polymerases, TENT4A and 
TENTA4B, incorporate intermittent 
non-A residues (G, U, or C) witha 
preference for guanosine, which 
results in a heterogenous poly(A) 
tail. Deadenylases trim poly(A) 
tails to initiate mRNA degradation 
but stall at the non-A residues. In 
effect, the not-so-pure tail sta- 
bilizes mRNAs by slowing down 
deadenylation. —BAP 


Science, this issue p. 701 


Can robots peer- 
pressure children? 


Most of us know the classic 
peer-pressure question: “If 
everyone jumped off a bridge, 
would you?” Vollmer et al. 
investigated whether robots can 
have the same effect on humans 
as our conspecifics do. Child or 
adult participants were asked to 
determine which of three lines 
on ascreen was the same as a 
reference line, but there was a 
secret twist: Three confederates 
seated with the participant all 
gave the same wrong answer. 
Adults were more likely to go 
along with the group's wrong 
answer when the confederates 
were all humans but resisted 
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the social pressure from a group 
of robots. Children, however, 
were susceptible to robot peer 
pressure, too, which raises the 
question of what safeguards are 
needed when young or vulner- 
able populations interact with 
automata. —RLK 

Sci. Robot. 3, eaat7111 (2018). 


Forests improve dietary 
diversity 
Micronutrient deficiency affects 
nearly two billion people world- 
wide and particularly affects 
children in developing countries. 
Analyzing data on children’s 
diets in more than 43,000 
households across 27 develop- 
ing countries, Rasolofoson et 
al. show that access to local 
forests, and the foods they pro- 
vide, directly improves dietary 
diversity and mitigates nutri- 
ent deficiencies comparably to 
nutrition-sensitive agricultural 
programs. The magnitude of 
this connection between nutri- 
tion and forests is greatest in 
communities with lower levels 
of development and implies that 
nutrition deficiencies in develop- 
ing communities may be driven 
by the loss or poor management 
of local forests. —AC 

Sci. Adv. 10.1126/sciadv.aat2853 

(2018). 


Laser-shocking 
deuterium into metal 


The conditions in which hydro- 
gen disassociates and becomes 
an atomic metal occur in high- 
energy-density environments, 
such as the interiors of giant 
planets and nuclear explo- 
sions. Celliers et al. trained 168 
lasers on deuterium samples 
at the National Ignition Facility 
to measure the pressure and 
temperature conditions of the 
hydrogen transition. Careful 
optical measurements led to the 
addition of four new points on 
the phase diagram, consistent 
with static estimates and theo- 
retical calculations. —BG 
Science, this issue p. 677 
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Viral genomes behind 
HIV remission 


HIV-positive patients are often 
dependent on continuous 
antiretroviral therapy (ART) to 
prevent AIDS progression (such 
patients are called noncon- 
trollers). A small proportion of 
individuals, called HIV post- 
treatment controllers, sustain 
HIV remission after receiving 
short-term ART. To understand 
the differences between these 
types of patients, Sharaf et al. 
sequenced HIV genomes from 
plasma samples. Before the 
interruption of ART, posttreat- 
ment controllers had a lower 
HIV genome reservoir size than 
noncontrollers by a factor of 7. 
Posttreatment controllers 

also had fewer defective HIV 
genome copy numbers and 
showed heightened HIV-specific 
immune responses and slower 
viral rebound after interrup- 
tion of ART. Analysis of HIV 
genome characteristics could 
provide important information 
for designing an individual's 
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optimal treatment plan. —MY 
J. Clin. Invest. 10.1172/JC1120549 
(2018). 


Detection by protection 
DNA aptamers are oligo- 
nucleotides that bind with high 
specificity to targets rang- 
ing from metal ions to whole 
proteins. Binding of the desired 
target can introduce structural 
changes in the DNA and thus 
signal the concentration of the 
target. Canoura et al. designed 
a DNA aptamer-—based bioas- 
say that can determine the 
concentration of multiple small 
molecules at once. If no target 
is present, enzymes degrade 
the DNA aptamer, which is 
unprotected on its own. Binding 
of the aptamer target locks the 
DNA ina stable conformation, 
protecting it from degradation 
and enabling readout through a 
second, labeled DNA. The assay 
was used to detect ATP and 
cocaine in a complex sample 
simultaneously. —MAF 

J.Am. Chem. Soc. 140, 9961 (2018). 
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Time is the key 
How does one construct an 
accurate time scale for ocean 
sediments? One way is to mea- 
sure their carbon-14 content 
and use that radioactive clock 
to date them. This approach 
has pitfalls, though, because 
the carbon fixed by organisms 
at the sea surface that is later 
locked up in sediments may 
itself have been old (i.e., the 
sediment clock may not have 
started at time zero). Zhao and 
Keigwin constructed a time 
scale for sediments from the 
eastern equatorial Pacific by 
dating twigs found within them, 
thereby eliminating the problem 
of correcting for surface reser- 
voir effects. Ventilation ages of 
the thermocline and intermedi- 
ate waters there during the Last 
Glacial Maximum and degla- 
ciation were similar to those 
of today, contradicting results 
from a number of previous stud- 
ies. —HJS 

Nat. Commun. 9, 3077 (2018). 


SCIENCE sciencemag.org 


fashion. —PRS 


Up through the 
atmosphere 

Surprisingly, a major component 
of the diet of swifts is spiders. 
Swifts only stop flying when 

they are breeding, so how do 

the spiders on which they feed 
become airborne? Morley and 
Robert discovered that spiders 
are sensitive to electric fields, 

so that they respond to the 
atmospheric potential gradient. 
The electric field mechanically 
activates hairs (trichobothria) on 
the spiders’ legs. Formation of a 
vertical electric field stimulates a 


From the concrete to the abstract 


arly processing in the brain tends to simply 
represent external information from the sensory 
organs. However, to behave appropriately, 
organisms need to structure this input into 
meaningful categories and concepts. Brincat 
et al. recorded from multiple regions along the 
processing hierarchy in monkey brains while the 
animals performed sophisticated categorization 
tasks. Lower-level areas conveyed strong informa- 
tion about sensory stimuli within their preferred 
domains but indicated hardly any abstraction 
beyond the raw sensory inputs. In contrast, the pre- 
frontal cortex, despite containing relatively weak 
information overall, showed strongly abstracted, 
task-relevant coding. Intermediate areas exhibited 
mixed representations, with partial categorical 
coding occurring in some domains but not others. 
Representational transformation thus happens 
gradually, across multiple cortical processing 
steps, rather than in a discrete, all-or-nothing 


Proc. Natl. Acad. Sci. U.S.A. 10.1073/pnas.1717075115 (2018). 


Monkeys use sophisticated categorization strategies— 
for example, while selecting what to eat. 


spider standing on top of a plant 
to release fine threads of silk and 
let go. The electric field strength 
of pointed leaves can achieve 
tens of kilowatts per meter in 
mildly unsettled weather, and 
the atmospheric potential gradi- 
ent provides enough coulomb 
force to lift a spider into the air 
without any help from passing 
breezes. The next thing to work 
out is what the advantages are to 
spiders to ascending sometimes 
thousands of meters into the 
atmosphere—it isn’t just to offer 
themselves as tidbits to swifts. 
—CA 

Curr. Biol. 28,2324 (2018). 


Spiders stand on prominences, emit strands of silk, and take to the air. 
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Overfed mice got | 
rhythm 


Metabolic health is intertwined 
with the body's circadian clock, 
but the mechanisms underly- 
ing this connection are not fully 
understood. Studying mice, 
Guan et al. examined the effect 
of diet-induced obesity on circa- 
dian expression of genes in the 
liver. They found that obesity 
intensifies and synchronizes 
the liver circadian rhythms 
of enhancers that control the 
expression of two transcrip- 
tion factors with opposing 
effects on lipid metabolism. The 
result is enhanced synthesis 
and oxidation of fatty acids. 
Drugs activating one of these 
transcription factors, PPARa, 
are already used clinically to 
reduce lipid levels. Interestingly, 
in obese mice, these drugs were 
most effective when delivered 
at peak PPARa expression, 
indicating that their efficacy in 
humans might be optimized by 
administration at certain times 
of day. —PAK 

Cell 174, 831 (2018). 


All together now 
Displacement reactions at 
saturated carbons often 
proceed by a concerted 
mechanism: A new bond forms 
at the same time as the old 
bond breaks. In aromatic rings, 
Meisenheimer complexes have 
been characterized with the 
incoming group bound before 
the leaving group departs, so 
a two-step mechanism is the 
default assumption. Kwan et al. 
now report that this assump- 
tion usually does not hold. 
They measured carbon kinetic 
isotope effects of several 
nucleophilic aromatic substitu- 
tions by using a sensitive °F 
nuclear magnetic resonance 
technique; they then bench- 
marked theoretical methods 
for a much broader virtual 
study. The net result was that 
83% of 120 simulated reac- 
tions were concerted. —JSY 
Nat. Chem. 10.1038/s41557-018- 
0079-7 (2018). 
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OPTICAL IMAGING 
More to imaging than 
meets the eye 


Traditional imaging techniques 
involve peering down a lens and 
collecting as much light from 
the target scene as possible. 
That requirement can set limits 
on what can be seen. Altmann 
et al. review some of the most 
recent developments in the 
field of computational imaging, 
including full three-dimensional 
imaging of scenes that are 
hidden from direct view (e.g., 
around a corner or behind 

an obstacle). High-resolution 
imaging can be achieved with a 
single-pixel detector at wave- 
engths for which no cameras 
currently exist. Such advances 
will lead to the development of 
cameras that can see through 
fog or inside the human body. 
—|SO 


Science, this issue p. 660 


WHEAT GENOME 
Insights from the 
annotated wheat genome 


Wheat is one of the major 
sources of food for much of the 
world. However, because bread 
wheat’s genome is a large hybrid 
mix of three separate subge- 
nomes, it has been difficult to 
produce a high-quality refer- 
ence sequence. Using recent 
advances in sequencing, the 
International Wheat Genome 
Sequencing Consortium presents 
an annotated reference genome 
with a detailed analysis of gene 
content among subgenomes and 
the structural organization for all 
the chromosomes. Examples of 
quantitative trait mapping and 
CRISPR-based genome modi- 
fication show the potential for 
using this genome in agricultural 
research and breeding. Ramirez- 
Gonzalez et al. exploited the 
fruits of this endeavor to identify 
tissue-specific biased gene 
expression and coexpression 
networks during development 
and exposure to stress. These 
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resources will accelerate our 
understanding of the genetic 


basis of bread wheat. —LMZ 
Science, this issue p. 662 


DEVELOPMENTAL 
SIGNALING 

How Wnt ligands achieve 
specificity 

Wnt signaling is essential for 
development, tissue homeo- 
stasis, and disease. The 19 
members of the Wnt family 
interact promiscuously with the 
10 Frizzled receptors, raising the 
question of how ligand-specific 
discrimination is achieved ina 
biological context. Eubelen et al. 
used experiments in zebrafish 
to show that cells are equipped 
with decoding modules that bind 
Wnt with high specificity and 
trigger signal amplification via 
their recruitment into higher- 
order Frizzled signalosomes 
(see the Perspective by Kim and 
Goentoro). Thus, distinct Wnt 
ligand-receptor pairs can be tar- 
geted specifically for therapeutic 
purposes. —BAP 


Science, this issue p. 663; 


see also p.643 


MEMBRANES 
Spraying makes it 
smoother 


Commercial reverse osmosis pro- 
cesses for water desalination use 
membranes made by the polym- 
erization of polyamide at the oil/ 
water interface. Chowdhury et 
al. show that thinner, smoother 
membranes can be made with 
an electrospray technique. Using 
high voltage, the two precursors 
are finely sprayed onto a sub- 
strate and polymerize on contact. 
The composition of the resulting 
membrane can be tuned on the 
basis of the proportion of the two 
components. At optimum condi- 
tions, the membranes appear 
to be better at desalination than 
current commercial reverse 
osmosis membranes. —MSL 
Science, this issue p. 682 
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PHOTOLUMINESCENCE 
Unmasking the glow of 
silver clusters 


Small silver clusters stabi- 
lized by organic materials or 
inorganic surfaces can exhibit 
bright photoluminescence, but 
the origin of this effect has been 
difficult to establish, in part 
because the materials are het- 
erogeneous and contain many 
larger but inactive clusters. 
Grandjean et al. studied silver 
clusters in zeolites, using x-ray 
excited optical luminescence 
to monitor only the emissive 
structures (see the Perspective 
by Quintanilla and Liz-Marzan). 
Aided by theoretical calcu- 
lations, they identified the 
electronic states of four-atom 
silver clusters bound with water 
molecules that produce bright 
green emission—thus identify- 
ing candidate materials for 
application in lighting, imaging, 
and therapeutics. —PDS 

Science, this issue p. 686; 

see also p.645 


ORGANIC CHEMISTRY 
Radically transforming 
light hydrocarbons 


The methane, ethane, and pro- 
pane in natural gas are mostly 
inert under ambient condi- 
tions. Mainly they are burned 
to produce heat. Hu et al. show 
that a simple cerium salt paired 
with an alcohol can catalyti- 
cally transform these and other 
simple hydrocarbons into 
reactive radicals at room tem- 
perature (see the Perspective 
by Kanai). The reactions rely 
on light to photolytically cleave 
cerium alkoxide bonds, produc- 
ing alkoxy radicals that strip H 
atoms from the hydrocarbons 
and regenerate the alcohol. The 
resultant alkyl radicals readily 
add to azo compounds, olefins, 
and aromatics. —JSY 

Science, this issue p. 668; 

see also p.647 
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ORGANIC CHEMISTRY 
Four varieties of carbony! 
sandwich 


Compounds with adjacent car- 
bons sandwiched between two 
carbonyl (C=O) centers turn up 
frequently in organic chemistry. 
When these central carbons 
each have a substituent, there 
are four possible mutual geom- 
etries, all with potentially distinct 
biochemical properties. Kaldre 
et al. present a single method to 
access each stereoisomer indi- 
vidually. The outcome depends 
on the straightforwardly 
tunable configuration of a 
sulfoxide group in a precursor, 
which guides a rearrangement. 
The versatility of the method 
should facilitate selective access 
to 1,4-dicarbony! motifs in phar- 
maceutical research. —JSY 
Science, this issue p. 664 


TRANSLATIONAL 
CONTROL 


Fragile X and fragile 
translation in flies 


Mutations in the fragile X mental 
retardation 1 (FMR1) gene under- 
ie fragile X syndrome and fragile 
X-associated primary ovarian 
insufficiency, which are promi- 
nent intellectual disability and 
reproductive disorders, respec- 
tively. FMR1 is thought to reduce 
protein synthesis (translation) at 
synapses. In Drosophila oocytes, 
Greenblatt and Spradling found 
that Fmr1 loss leads to oocytes 
that generate embryos exhib- 
iting neural defects (see the 
Perspective by Aryal and Klann). 
Ribosome profiling of oocytes 
identified a specific role for 
FMR1 in enhancing the transla- 
tion of large proteins, including 
many associated with autism. 
FMR1 seems to help maintain 
translation of large mRNAs that 
otherwise condense into inactive 
ribonucleoprotein particles. This 
mechanism may underlie other 
causes of autism and mental 
dysfunction. —BAP 
Science, this issue p. 709; 
see also p. 648 
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LIVER DISEASE 
Setting liver 
regeneration free 


The mammalian liver has 
considerable regenerative 
capacity. After acute severe 
injury, aS occurs in acetamino- 
phen (paracetamol or Tylenol) 
poisoning, liver regeneration 
and recovery may fail. Bird 

et al. show that senescence, 
which is classically associated 
with aging and carcinogenesis, 
prevents liver cell proliferation 
and reduces the liver’s regenera- 
tive capacity after acute injury. 
Senescence spreads from cell to 
cell through signaling by trans- 
forming growth factor—B (TGFB). 
When TGF signaling was 
blocked during acetaminophen 
poisoning in mice, senescence 
was impeded, regeneration 
accelerated, and mouse survival 
increased—thus presenting an 
attractive therapeutic approach 
for liver regeneration. -OMS 


Sci. Transl. Med. 10, eaanl1230 (2018). 


LYMPHATICS 
Setting the stage for 
attack 


There is growing recognition of 
the importance of stromal cells 
in shaping immune organs and 
immune responses. In classi- 

cal secondary lymphoid organs 
(SLOs) such as lymph nodes, 
tonsils, and Peyer's patches, 
fibroblastic reticular cells 
(FRCs) play an integral part in 
immune responses. Nonclassical 
SLOs, including fat-associated 
lymphoid clusters (FALCs), also 
have important roles in systemic 
immunity. Perez-Shibayama et 
al. report that FRCs in FALCs are 
both organizational and immu- 
nomodulatory. —AB 


Sci. Immunol.3, eaar4539 (2018). 


HORMONE SIGNALING 
How progesterone 


stimulates cilia 

The transport of eggs depends 
on the beating of the cilia 

of the cells lining the ovi- 

duct. Cilia beating is a Ca** 


SCIENCE sciencemag.org 


signaling—dependent process. 
Jung et al. investigated how the 
hormone progesterone affects 
Ca** signaling in mouse ciliated 
oviduct cells. The hormone 
increased intracellular Ca** 
concentrations in a process 
requiring the cation chan- 
nel TRPV4 and the stepwise 
activation of GABA, and GABA, 
receptors. These receptors 
associated with each other in 
response to progesterone and 
GABAergic agonists. —JFF 

Sci. Signal. 11, eaam6558 (2018). 
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Quantum-inspired 


computational imaging 


Yoann Altmann, Stephen McLaughlin, Miles J. Padgett, Vivek K Goyal, 


Alfred O. Hero, Daniele Faccio* 


BACKGROUND: Imaging technologies, which 
extend human vision capabilities, are such a 
natural part of our current everyday experience 
that we often take them for granted. However, 
the ability to capture images with new kinds of 
sensing devices that allow us to see more than 
what can be seen by the unaided eye has a rel- 
atively recent history. 

In the early 1800s, the first ever photograph 
was taken: an unassuming picture that required 
days of exposure to obtain a very grainy image. 
In the late 1800s, a photograph was used for 
the first time to see the movement of a running 
horse that the human eye alone could not see. 
In the following years, photography played a 
pivotal role in recording human history, rang- 
ing from influencing the creation of the first 
national parks in the United States all the way 
to documenting NASA’s Apollo 11 mission to 
put a man on the Moon. In the 1990s, roughly 


10 billion photographs were taken per year. 
Facilitated by the explosion in internet usage 
since the 2000s, this year we will approach 
2 trillion images per year—nearly 1000 images 
for every person on the planet. This upsurge is 
enabled by considerable advances in sensing 
and data storage and communication. At the 
same time, it is driving the desire for imaging 
technology that can further exceed the capabil- 
ities of human vision and incorporate higher- 
level aspects of visual processing. 


ADVANCES: Beyond consumer products, re- 
search labs are producing new forms of imag- 
ing that look quite different from anything 
we were used to and, in some cases, do not 
resemble cameras at all. 

Light is typically detected at relatively high 
intensities, in the spectral range and with frame 
rates comfortable to the human eye. However, 
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Quantum-based imaging systems are being developed to image through opaque media 
(e.g., fog or human tissue) that scatter light in all directions. 


Altmann et al., Science 361, 660 (2018) 
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emerging technologies are now relying on 
sensors that can detect just one single photon, 
the smallest quantum out of which light is 
made. These detectors provide a “click,” just 
like a Geiger detector that clicks in the pres- 
ence of radioactivity. We have now learned to 
use these “click” detectors to make cameras 
that have enhanced properties and applica- 
tions. For example, videos 
can be created at a tril- 
lion frames per second, 
making a billion-fold jump 
in speed with respect to 
standard high-speed cam- 
eras. These frame rates 
are sufficient, for example, to freeze light in 
motion in the same way that previous photo- 
graphy techniques were able to freeze the 
motion of a bullet—although light travels a 
billion times faster than a supersonic bullet. 
By fusing this high temporal resolution to- 
gether with single-photon sensitivity and ad- 
vanced computational analysis techniques, a 
new generation of imaging devices is emerging, 
together with an unprecedented technological 
leap forward and new imaging applications 
that were previously difficult to imagine. 
For example, full three-dimensional (3D) 
images can be taken of a scene that is hidden 
behind a wall, the location of a person or car 
can be precisely tracked from behind a corner, 
or images can be obtained from a few photons 
transmitted directly through an opaque mate- 
rial. Inspired by quantum techniques, it is also 
possible to create cameras that have just one 
pixel or that combine information from mul- 
tiple sensors, providing images with 3D and 
spectral information that was not otherwise 
possible to obtain. 


OUTLOOK: Quantum-inspired imaging tech- 
niques combined with computational approaches 
and artificial intelligence are changing our per- 
spective of what constitutes an image and what 
can or cannot be imaged. Steady progress is 
being made in the direction of building cam- 
eras that can see through fog or directly inside 
the human body with groundbreaking poten- 
tial for self-driving cars and medical diagnosis. 
Other cameras are being developed that can 
form 3D images from information with less 
than one photon per pixel. Single-photon cam- 
eras have already made their way into widely 
sold smartphones where they are currently used 
for more mundane purposes such as focusing 
the camera lens or detecting whether the phone 
is being held close to one’s ear. This technol- 
ogy is already out of the research laboratories 
and is on the way to delivering fascinating im- 
aging systems. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: daniele.faccio@glasgow.ac.uk 
Cite this article as Y. Altmann et al., Science 361, eaat2298 
(2018). DOI: 10.1126/science.aat2298 
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Quantum-inspired 


computational imaging 


Yoann Altmann’, Stephen McLaughlin’, Miles J. Padgett”, Vivek K Goyal’, 


Alfred O. Hero*, Daniele Faccio”* 


Computational imaging combines measurement and computational methods with the 
aim of forming images even when the measurement conditions are weak, few in number, 
or highly indirect. The recent surge in quantum-inspired imaging sensors, together with 

a new wave of algorithms allowing on-chip, scalable and robust data processing, has 
induced an increase of activity with notable results in the domain of low-light flux imaging 
and sensing. We provide an overview of the major challenges encountered in low-illumination 
(e.g., ultrafast) imaging and how these problems have recently been addressed for 
imaging applications in extreme conditions. These methods provide examples of the 
future imaging solutions to be developed, for which the best results are expected to arise 
from an efficient codesign of the sensors and data analysis tools. 


omputational imaging is the fusion of com- 

putational methods and imaging techniques 

with the aim of producing better images, 

where “better” has a multiplicity of mean- 

ings. The development of new imaging 
sensors and, in particular, instruments with single- 
photon sensitivity, combined with a new wave of 
computational algorithms, data handling capa- 
bility, and deep learning, has resulted in a surge 
of activity in this field. 

One clear trend is a shift away from increasing 
the number of megapixels and toward fusing 
camera data with computational processing and, 
if anything, decreasing the number of pixels, 
potentially to a single pixel. The incoming data 
may therefore not actually look like an image 
in the conventional sense but are transformed 
into one after a series of computational steps 
and/or modeling of how the light travels through 
the scene or the camera. This additional layer 
of computational processing frees us from the 
chains of conventional imaging techniques 
and removes many limitations in our imaging 
capability. 

We briefly describe some of the most recent 
developments in the field, including full three- 
dimensional (3D) imaging of scenes that are hid- 
den (e.g., around a corner or behind an obstacle), 
high-resolution imaging with a single-pixel de- 
tector at wavelengths for which no cameras exist, 
cameras that can see through fog or inside the 
human body, and cameras that mimic the hu- 
man eye by creating detail only in areas of in- 
terest. We will also discuss how multispectral 
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imaging with single-photon detectors can im- 
prove 3D reconstruction and provide richer in- 
formation about a scene. 

We discuss how single-photon detection tech- 
nologies are transforming imaging capabilities 
with single-photon-sensitive cameras that can 
take pictures at the lowest light levels and with 
the ability to create videos reaching a trillion 
frames per second. This improvement has en- 
abled the capture of images of light beams trav- 
eling across a scene and provided opportunities 
to observe image distortions and peculiar rela- 
tivistic temporal inversion effects that are due 
to the finite speed of light. The ability to cap- 
ture light in flight also underpins some of the 
applications mentioned above—for example, the 
ability to view a 3D scene from around a corner. 
Probabilistic modeling of the particlelike nature 


of light when using single-photon detectors has 
stimulated the birth of new computational tech- 
niques such as “first-photon imaging,” which 
hints at the ultimate limits of information to be 
gained from detecting just one photon. 


Single-pixel and ghost imaging 

Although most imaging techniques that have 
emerged recently are based on classical detec- 
tors and cameras, some of these approaches have 
been inspired by or have a tight connection with 
similar ideas in quantum imaging. A prime ex- 
ample is ghost imaging (Fig. 1) (J), originally 
thought to be based purely on quantum princi- 
ples but now recognized as being dependent 
on spatial correlations that can arise from both 
quantum and classical light (2). The realization 
that this technique does not require quantum 
light led to a merging of the fields of computa- 
tional ghost imaging (3) and work on single-pixel 
cameras (4), as well as to an overall increase of 
activity in the field. In its quantum version, ghost 
imaging refers to the use of parametric down- 
conversion to create pairs of photons with corre- 
lated positions. If we detect the position of one 
photon with a standard camera and illuminate 
an object with the other position-correlated pho- 
ton, it is sufficient to detect only the reflectance 
or transmittance of the object with a single-pixel 
detector—i.e., to measure the correlation count 
between the beams to then reconstruct a full 
image by repeating the measurement with many 
different photon pairs (each of which will be 
randomly distributed because of the random 
nature of the correlated photon pair generation 
process) (5, 6). It is now acknowledged that the 
quantum properties of the correlated photons 
play no role in the image reconstruction process: 
Thermal light split into two beams using a beam 
splitter can be used equally effectively, albeit at 
a higher photon flux (7). Rather than using a 
beam splitter, it is possible to use a spatial light 
modulator to create a pattern where the copy is 
simply the computer memory. This approach 


Bucket detector, A, 


Computer 
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CCD, In(x,y) 


Fig. 1. Ghost imaging. Random spatial patterns, R,, illuminate an object and only the total 
transmitted (or reflected) light, A, is measured. This intensity reading is then computationally 
combined with the information of the pattern, /,(x, y) (either measured separately or known if 
generated by a computer), to form an image of the object. CCD, charge-coupled device. 
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therefore no longer requires a camera of any 
kind in the setup: The computer-generated pat- 
tern is already known and the image, J, can be 
retrieved by multiplying the single-pixel readout, 
a;, with the corresponding pattern, H,, and then 
summing over all patterns, i.e., J = So ath This 
opens the route to so-called compressed single- 
pixel imaging, in which assumptions about the 
spatial correlations of the image enable patterns 
to be judiciously chosen to require far fewer pat- 
terns than the final number of image pixels, with 
compression factors as high as 80 to 90%. This 
concept is not dissimilar from standard JPEG 
compression, which assumes that typical images 
are concentrated in their spatial frequencies, 
with the difference that now the compression 
is applied at the image acquisition stage. By this 
compression, single-pixel imaging is therefore 
transformed from a slow, relatively inefficient 
process into a highly efficient imaging technique 
that can operate at video frame rates in full color 
(8). More recent developments include exten- 
sions to lensless imaging (9) and to full 3D images 
for which depth information is obtained by also 
using time-of-flight information (J0, 17)—i.e., in 
addition to object reflectivity, the imaging system 
also estimates the light travel distance, d, from 
the temporal shift, t, of the detected signal, as 
the two are related by the speed of light, d = ct, 
in free space, where c is the speed of light. In 
general, this single-pixel technique suffers 
from having low resolution and providing poor- 
quality images even when compared with a cell- 
phone camera. This limitation may be partly 
overcome by taking inspiration from nature and 
implementing computational algorithms so that 
the system increases the density of the projected 
spatial patterns only in areas of interest, there- 
fore increasing the spatial resolution in regions 
where it is needed and leaving the surrounding 
areas relatively less defined (12). This is just one 
example of computational techniques being com- 
bined with detection technology to provide more 
efficient sensing solutions. Another example is 
the first-photon imaging approach that emerged 
from a codesign of hardware and computational 
algorithms, built around the concept of single- 
photon detection. 


First-photon imaging 

An important legacy of recent interest in the field 
of quantum information science is the develop- 
ment of a series of detector technologies for 
single photons. The workhorse for most labo- 
ratories is the single-photon avalanche diode 
(SPAD). SPADs are, in essence, semiconductor 
diodes that are reverse-biased beyond their 
breakdown threshold: A single photon (or even 
a thermally generated charge in the diode) is 
sufficient to lead to the rapid charge multiplica- 
tion process (or avalanche) that creates a spike 
in the output current. A quenching mechanism 
stops the avalanche process before the diode is 
irreversibly damaged, leading also to a dead time 
during which the diode is insensitive to incident 
photons before being reactivated. The particle- 
like nature of a photon is revealed through the 
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very short burst in the SPAD output current that 
can then be very precisely timed when a refer- 
ence signal is also provided. The ability to pre- 
cisely detect the photon arrival time can be used 
for long-distance, high-precision light detection 
and ranging (LIDAR). A distant object is illumi- 
nated with a pointlike pulsed laser beam. Each 
outgoing pulse starts a counter, which is then 
stopped at time t when a return photon is 
detected; accounting for the two directions of 
the light travel, the distance of the illuminated 
object is simply ct/2. Scanning the scene using 
this time-correlated single-photon counting 
(TCSPC) technique can therefore provide a 
full 3D image (or depth image) of the scene 
(13-15). However, TCSPC-based imaging can re- 
quire very long acquisition times, in particular 
when photons return to the detector at a low 
rate. Conventional processing techniques require 
(i) operation in the photon-starved regime (i.e., 
10% or less of the outgoing laser pulses should 
give rise to a detected return photon so that bias 
from detector dead times is negligible) and (ii) 
measurement over many illumination repetition 


background and the particlelike nature of the 
photons and their detection) built into the in- 
formation used to retrieve high-quality 3D im- 
ages. Similar extreme photon efficiency can be 
achieved with a fixed dwell time at each scene 
position (78), and principled statistical tech- 
niques for adapting the local spatial resolution 
to characteristics of the data enable background 
noise 25 times as strong as the back-reflected 
signal to be mitigated (19). Additional perform- 
ance improvements have been obtained with 
deep learning (20). Using an array of SPADs 
parallelizes the data acquisition and thus can 
increase imaging speed, though an array has 
coarser time resolution, translating to coars- 
ened longitudinal distance measurements (27). 
Methods for arrays can also be highly photon 
efficient (22). 


Non-line-of-sight imaging 

Photon counting has strongly affected the field 
of non-line-of-sight (NLOS) imaging—i.e., the 
imaging of objects that are, for example, hidden 
behind a wall, corner, or obstacle (23-32). Access 


Fig. 2. First-photon imaging. (A) Each photon detection can be mapped to a 3D position, 

which is often far from correct because half of the detections are due to background light. The 
number of illumination pulses until a photon is detected is inversely proportional to an initial 
estimate of reflectivity. (B) Exploiting piecewise smoothness yields improved reflectivity estimates. 
(C) Approximate noise censoring removes most detections due to background light. (D) The 
final estimate exploits piecewise smoothness of depth. [Adapted from figure 2 of (16)] 


periods so that 100 to 1000 photons or more are 
detected for each position. Under these condi- 
tions, a faithful measurement of the photon ar- 
rival time is obtained. This approach can easily 
lead to acquisition times of a complex scene 
that can be on the order of many seconds or even 
minutes. 

The computational imaging philosophy en- 
ables a marked reduction in the number of de- 
tected photons needed for 3D imaging (16). In 
the first-photon imaging approach, only the 
very first detected photon at each scan location 
is used, so the acquisition time is limited primar- 
ily by the speed of scanning, and any detector 
dead time coincides with the scanning (77). Using 
the number of pulses until a photon is detected 
as an indirect measurement of reflectivity, along 
with a piecewise-smooth assumption for both 
reflectivity and depth, a 3D image of a scene is 
produced after several computational steps, as 
shown in Fig. 2. This approach builds a strong 
link between the computational steps and the 
detailed mechanism of single-photon detec- 
tion, with various aspects (such as the noise 
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to very high temporal resolution imaging sys- 
tems allows reconstruction of a full 3D image 
of the hidden scene, as conceptually explained 
in Fig. 3A. A short laser pulse is sent to a scat- 
tering surface chosen so as to scatter light be- 
hind the obstacle and thus illuminate the hidden 
scene. The hidden scene will reflect a return echo 
that will once again hit the first scattering wall 
and return to the imaging system. An intuitive 
understanding of the hidden object reconstruc- 
tion is based on the fact that the locus of points 
that can give rise to a backscattered signal from 
a laser spot at a position 7; = (a, yz) and mea- 
sured at a given point 7; = (a;, y;) on the wall is 
given by |r — 1;| + |r — r;| + |z| = ct. This equa- 
tion describes an ellipsoid of points that can be 
recalculated for each detection point on the wall: 
each of these ellipsoids will overlap only at the 
points of origin of the (hidden object) scattering. 
Therefore, by summing over all ellipsoids, one 
obtains a high “intensity” (proportional to the 
overlap) in correspondence to the hidden ob- 
ject. With sufficient temporal resolution and 
additional processing to sharpen the retrieval, 
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it is possible to reconstruct full 3D shapes: for 
example, 100 ps is sufficient to resolve centimeter- 
sized features. Practically, most retrieval tech- 
niques aim at iteratively finding an estimate of 
p(x, y, 2), which represents the physical distri- 
bution of the hidden object, from the measured 
transient image intensity 


1 
1(0,y,2) = Il) 5 38(r—nl + rnd + [zl — en) 
Ea 
x p(@, y, 2)dadydz 


where 6 represents Dirac’s delta function. The 
first demonstration of this technique was ob- 
tained with the use of a streak camera (23) 
that provides very high 1- to 10-ps temporal res- 
olution but at the expense of relatively long 
acquisition times (Fig. 3, B and C). Successive 
demonstrations resorted to single-photon count- 
ing to reconstruct 3D images (24) and to perform 
tasks such as tracking of moving objects (27) and 
humans, even over very large distances [more 
than 50 m between the scattering wall and the 
imaging system (28)]. Recent improvements have 
demonstrated acquisition times on the order of 
seconds for a full 3D scene reconstruction by 
modifying the acquisition scheme. Specifically, 
photons are collected coaxially [i.e., along the 
same trajectory as the outgoing laser beam (31)], 
and, as a result, the measurement integral is 
simplified to 


I(2,y,1) = {fh a0 +7, — 1¢)p(a@, y, z)dadydz 


where the radiometric factor 1/r* is now only 
a function of t and can thus be removed from 
the integral. Overall, the result of this is that 
I(x, y, t) reduces to a convolution that substan- 
tially decreases the computational retrieval times, 
paving the way to real-time reconstruction of 
3D scenes. This is an example of how imaging 
hardware and computational techniques have 
coevolved to create a new imaging capability. 
It is worth pointing out that recent measure- 
ments have shown not only real-time capability 
but also the capacity for long-distance and full- 
daylight operation (28, 31), thus moving from 
proof-of-principle studies to first steps toward 
deployment in real-world applications in just a 
few years. An interesting challenge for this field 
of research starts from the observation that much 
of the technology involved in NLOS imaging is 
common with standard, direct line-of-sight of 
LIDAR (i.e., 3D imaging of environments using 
laser pulse time-of-flight measurements). In this 
sense, NLOS imaging has the potential to become 
a natural extension of LIDAR. In this context, 
there are clear applications for NLOS imaging, 
when combined with LIDAR, for urban safety and 
unmanned vehicles. Additionally, future NASA 
missions will employ SPAD arrays for LIDAR 
mapping of planet surfaces, and studies are cur- 
rently under way to evaluate the potential of 
NLOS imaging to remotely (e.g., from a satellite) 
assess the internal structure of underground 
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caves on planets, with a view toward future hu- 
man colonization activities (33). 


Enhanced SPAD arrays for imaging in 
scattering media 


Over the past several years, a number of indus- 
trial and academic research groups have been 
developing a new generation of cameras in which 
each individual pixel consists of a SPAD. A rela- 
tively widespread version of these sensors, of- 
ten referred to as “quanta imaging sensors,” is 
operated in binary mode—that is, the sensor pixel 
generates a “1” when the number of photons is 
larger than a certain threshold (typically set to 
just one photon) and generates a “O” otherwise 
(34-38). Each frame therefore has a single-bit 
depth. To build an image, multiple frames must 
be added together. This operation mode brings 
some advantages: Aside from the single-photon 
sensitivity, one can add as many frames as desired 
so as to achieve very high bit depths (dynamic 
ranges) that are not attainable with standard 
complementary metal-oxide semiconductor cam- 
eras. Moreover, the single-bit nature of the acqui- 
sition permits very high frame acquisition rates 
(rates up to 100 kHz have been reported) (39). 
Progress has also been made in the full on-chip 
integration of TCSPC electronics, thus providing 
the additional functionality of temporal resolu- 
tions as low as 50 ps (21, 40-43). This implies that, 
when combined with a high repetition rate laser 
for the active illumination of the scene, video 
rates reaching up to 20 billion frames per second 
can be achieved (44). This remarkable perform- 
ance can be better appreciated when expressed 
in terms of the actual imaging capability. At such 
frame rates, light propagates just 1.5 cm between 
successive frames, which implies that it is pos- 
sible to actually freeze light in motion in much 
the same way that standard high-speed cameras 
can freeze the motion ofa supersonic bullet. The 
first images of light in flight were shown in the 
late 1960s via nonlinear optical gating methods 
(45-47), but the first camera-based measure- 
ments were only recently demonstrated with the 
use of a streak camera (48). More recent mea- 
surements based on SPAD arrays have allowed 
the first capture of light pulses propagating in 
free space with total acquisitions times on the 
order of seconds or less (44). SPAD array cam- 
eras have also been used to directly image laser 
pulse propagation through optical fibers: Beyond 
their direct applications (e.g., estimation of phys- 
ical parameters of optical fibers), these measure- 
ments combined a fusion of single-photon data 
with hyperspectral imaging over several different 
wavelengths (discussed below) and computa- 
tional processing through which the 32-pixel- 
by-32-pixel resolution was successfully up-sampled 
by using the temporal axis to re-interpolate the 
pulse trajectory in the (a, y) spatial plane (49). 
The ability to capture simultaneously spatial 
and high-resolution temporal information at very 
low light levels with SPAD cameras has recently 
been applied to other difficult imaging problems, 
such as imaging and sensing through scattering 
and turbid media. For example, Pavia et al. have 
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applied inverse retrieval methods in combination 
with spatial and temporal information from a 
linear SPAD array for tomographic reconstruc- 
tion of objects hidden in murky water (50). More 
recently, Heshmat and co-workers have acquired 
data with a planar SPAD array and reconstructed 
various shapes of objects obscured by a thick tis- 
sue phantom (57). Their technique was referred 
to as “All Photons Imaging,” directly underlining 
the importance of the photon time-of-flight in- 
formation that is recorded by the single-photon 
camera. We note that such approaches do not 
explicitly account for the physical origins of the 
data. For example, temporal and spatial informa- 
tion are placed on equal footing and enter in the 
retrieval process without incorporation of statis- 
tical models for timing jitter or surface reflec- 
tivity of the objects. Future SPAD imaging will 
benefit from incorporating accurate spatiotem- 
poral statistical models for sources, photon trans- 
port medium, and photon detectors. In the broad 
regime of strong scattering, the camera will typi- 
cally record an indistinct, diffuse illumination 
transmitted through the medium or reflected 
from the scene with little or no obvious informa- 
tion about any objects hidden behind or inside the 
scattering medium. Computational techniques 
are thus required to actually retrieve details about 
the object. This field of research is of particular 
interest for a number of applications such as med- 
ical diagnostics and imaging, as well as sensing 
and imaging through fog. 

With the emergence of imaging systems for 
autonomous underwater vehicles, unmanned 
aerial vehicles, robots, or cars, rain and fog pres- 
ent important challenges that must be overcome. 
Sonar is a well-established technology for long 
range underwater imaging, but it can suffer from 
low spatial resolution limited by the physics of 
sound propagation in the medium. Although 
high-power optical solutions can be used for 
short-range imaging in relatively clear water, 
the presence of underwater scatterers between 
the active imaging system and the scene (e.g., the 
seabed) usually produce large quantities of re- 
flected photons that can mask the returns from 
the scene of interest. By using a pulsed illumi- 
nation source combined with sensitive single- 
photon detectors, it is possible to discriminate 
the photons reflected because of scattering in the 
water from those (an extremely small refraction) 
that actually reach the surfaces of interest. For 
instance, Maccarone et al. have demonstrated 
the ability to image underwater up to eight at- 
tenuation lengths (52). When combining this 
cutting-edge technology with advanced statisti- 
cal methods inspired by our previous work (53), 
substantial performance improvements could be 
achieved in terms of 3D reconstruction and esti- 
mation of the surface reflectivity by accounting 
for the distance-induced signal loss (54). Ona 
related note, efforts for 3D reconstruction of 
terrestrial scenes at long distances suffer from 
limitations similar to those described above. 
Even if the measurements are performed under 
favorable (e.g., dry) conditions, the recorded sig- 
nals can be considerably affected by atmospheric 
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Fig. 3. Non-line-of-sight imaging. (A) Basic geometry of the problem. A 
laser illuminates a scattering surface and scatters light around a wall 
that hides an object from the direct line of sight. The return signal 
backscattered from the hidden object is detected at a point “i” on the 
scattering surface. This geometry, with a single observation point, defines 


turbulence (55-57) and solar illumination (58, 59). 
Again, marked improvements in detection accu- 
racy (60) and maximal observable range (67) have 
been obtained via the use of adapted computa- 
tional tools. The problem becomes even more 
acute in the presence of fog, which is a major con- 
cern for the next generation of automated cars. 
It has been demonstrated that it is technically 
possible to detect and analyze fog patches over 
long distances, provided that the laser power is 
sufficient to ensure a nonzero probability of pho- 
ton reflection and a long enough acquisition 
time (62, 63). In the automotive context, where 
the acquisition time is intrinsically limited by the 
vehicle displacement velocity, more robust and 
computationally efficient strategies have been 
recently proposed (51, 64), and it is clear that 
future imaging systems will incorporate com- 
putational models for both the propagation phys- 
ics of the medium and physical properties of the 
detector. 


Multispectral single-photon imaging 


Multispectral and hyperspectral imaging, which 
are extensions of classical color (RGB) imaging, 
consist of imaging a scene using multiple wave- 
lengths (from four to several hundreds or even 
thousands in hyperspectral images). These mo- 
dalities have benefited from a robust body of 
research spanning more than 35 years, from the 
data collection community (65-67), and, more 
importantly, from the data processing and anal- 
ysis community (68-73). Indeed, such modalities 
can be associated with a wide variety of computa- 
tional problems, ranging from image acquisition 
(compressive sampling), restoration (denoising/ 
deblurring, superresolution), segmentation (clas- 
sification) to source separation (spectral unmix- 
ing), object/anomaly detection, and data fusion 
(e.g., pansharpening). Though the main applica- 
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tions using (passive) multispectral imaging are 
in Earth and space observation, the proven ben- 
efits of imaging with multiple wavelengths simul- 
taneously have enabled its application in the food 
industry (66, 74) and a broader range of appli- 
cations such as diagnostic dermatology (75, 76). 
Active multispectral imaging is less sensitive to 
ambient illumination than passive imaging, which 
requires data acquisition under daylight condition 
(e.g., for Earth observation). Without fast timing 
capabilities, however, multi- and hyperspectral 
imagers are only able to provide 2D intensity 
profiles and are thus poorly adapted to analysis 
of multilayered 3D structures such as forest 
canopies. Multispectral LIDAR is a promising 
modality that allows for joint extraction of geo- 
metric (as single-wavelength LIDAR) and spectral 
(as passive multispectral images) information 
from the scene while avoiding data registration 
issues potentially induced by the fusion of het- 
erogeneous sensors. Wallace et al. have demon- 
strated that it is possible to use multispectral 
single-photon LIDAR (MSPL) to remotely infer 
the spatial composition (leaves and branches) 
and the health of trees using only four wave- 
lengths (77). More recently, new experiments have 
been designed to image up to 33 wavelengths 
(500 to 820 nm) in free space (78) and 16 wave- 
lengths underwater (79). As a consequence, we 
have witnessed the development of algorithms 
inspired from passive hyperspectral imagery 
(3D datacubes) for analysis of MSPL data (4D 
datacubes). 

For instance, Bayesian methods have been 
proposed to cluster, in an unsupervised manner, 
spectrally similar objects while estimating their 
range from photon-starved MSPL data (80). This 
work was further developed (8/7, 82) to classify 
pixels on the basis of their spectral profiles in 
photon-starved regimes down to one photon per 
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an ellipsoid of possible locations for the object. Detection of the time- 
resolved transient images at multiple points on the surface allows 
reconstruction of the location or even the full 3D shape of the object. 
(B) An example of a hidden object with its reconstruction shown in (C). 
[Panels (B) and (C) adapted from (21)] 


pixel and per spectral band, on average (Fig. 4). 
Such results are possible only by efficiently com- 
bining a highly sensitive raster-scanning single- 
photon system that allows for submillimeter range 
resolution with hierarchical Bayesian models able 
to capture the intrinsic, yet faint, structures (e.g., 
spatial and spectral correlations) of the data. A 
notable improvement has been demonstrated by 
using simulation methods (see next section) to 
reconstruct scenes (range and reflectivity pro- 
files) with as few as four photons per pixel (with 
four spectral bands and one photon per pixel, on 
average) (81). 

Spectral unmixing presents another challeng- 
ing problem in the use of multi- and hyperspectral 
data for identification and quantification of ma- 
terials or components present in the observed 
scene. Spectral unmixing can lead to improved 
classification by accounting for the fact that sev- 
eral mixed materials can be observed in a given 
pixel. Spectral unmixing methods allow for sub- 
pixel material quantification, which is particularly 
important for long-range imaging scenarios in 
which the divergence of the laser beam cannot be 
neglected. We developed a computational method 
for quantifying and locating 15 known materials 
from MSPL data consisting of 33 spectral bands 
while detecting additional (potentially unknown) 
materials present in the scene (78). Again, this 
work demonstrated the possibility of material 
quantification and anomaly detection with as 
little as 1 photon per pixel and per spectral band, 
on average. It also illustrated how Bayesian mod- 
eling can be used for uncertainty quantification— 
e.g., for providing confidence intervals associated 
with estimated range profiles. As mentioned 
above, although the most recent single-photon 
detectors are very attractive because of their high 
temporal resolution, their application to informa- 
tion extraction from wide-area scenes is hindered 
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by long acquisition times associated with raster 
scanning strategies. This is particularly limit- 
ing when several wavelengths are acquired in 
a sequential manner. To address this problem, 
compressive sampling strategies have been in- 
vestigated to achieve faster MSPL data acquisi- 
tion (83, 84). Although computational methods 
for image scanning systems have been proposed, 
whereby a random number of spectral bands can 
be probed in a given pixel, the most promising 
results have been obtained with a simulated mo- 
saic filter (four wavelengths) whose implemen- 
tation within a SPAD array could allow for the 
simultaneous acquisition of multiple pixels and 
fast reconstruction of range and reflectivity pro- 
files. These results show how advanced com- 
putational methods can be used to enhance 
information extraction from imaging systems 
and also improve the design of future detectors 
and detector arrays. 


Computational methods in the 
photon-starved regime 


From a mathematical perspective, computational 
imaging is formulated as finding a mapping that 
reconstructs a set of parameters a, which may 
have a physical meaning (or not), from a set of 
measurements y recorded by an imaging or 
sensing device. These parameters can take con- 
tinuous values (e.g., light field intensities, object 
positions, and velocities) or discrete values (e.g., 
the number objects, binary values representative 
of the presence or absence of objects). Two main 
families of methods can be adopted to design al- 
gorithms for data analysis—namely, supervised 
machine learning approaches and statistical 
signal processing approaches—although hybrid 
methods can also be used. The choice of the most 
suitable approaches depends primarily on the 
complexity of the computational model, as well 
as the computational budget available (i.e., the 
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expected processing time, data storage limita- 
tions, and the desired quality of the information 
extracted). Supervised machine learning (includ- 
ing deep learning) approaches are particularly 
well suited for applications where a sufficient 
quantity of ground truth data or reference data 
is available (85-88). Such methods rely on a two- 
stage process, which consists of the training 
stage and the test stage. Starting from a forward 
model y = g(2), relating the measured data y to 
the unknown source parameters 2, the training 
stage uses a set of measurements and corre- 
sponding parameters to learn the inverse map- 
ping h(-) between the measurements and the 
set of parameters to be recovered—i.e., it fits an 
inverse model a ~ h(y). In contrast to model- 
based statistical methods, data-driven machine 
learning approaches do not rely on the knowl- 
edge a forward model g(-). Thus, these methods 
can often be applied to complex problems where 
the forward model is unknown or too compli- 
cated to be derived analytically but for which 
plenty of training data are available. The training 
stage controls the quality of the estimation of the 
mapping /A(-), which in turn depends on the 
representational power of the machine learning 
algorithm and on the quality and diversity of the 
training samples. Machine learning approaches 
have been successfully applied to imaging ap- 
plications such as imaging through multimode 
fibers (85), lensless imaging of phase objects (86), 
and identification of a human pose from behind 
a diffusive screen (87). SPAD cameras have been 
specifically applied to identifying both the posi- 
tions and identities of people hidden behind a 
wall (88), imaging through partially transmissive 
(89) and scattering media (57), and profiling 
camouflaged targets (90). The design of reliable 
machine learning approaches is currently limited 
by high generalization error (i.e., machines must 
be retrained for different acquisition scenarios) 


02 0 O02 O04 


Fig. 4. Computational inverse probability methods to spectrally classify and depth-resolve 
objects in a scene from photon-starved multispectral LIDAR images. The scene (A) was 


composed of 14 clay materials of different colors. The recorded images consist of a 200-pixel—by— 
200-pixel area (scanned target areas were approximately 50 mm by 50 mm), and the targets were 
placed 1.85 m from the system. In (B), the first column depicts the estimated depth profile (in 
millimeters), the reference range being arbitrarily set to the backplane range. The second column 
shows color classification maps, and the third column depicts the spectral signatures of the most 
prominent classes, projected onto the first and second axes obtained by using principal components 
analysis. Each of these subplots illustrates the similarity between the estimated spectral signatures. 
Rows (i) and (ii) in (B) depict results obtained with an average of one detected photon per pixel, for 
each spectral band, with 33 and 4 bands, respectively. [Adapted with permission from (73)] 
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and a lack of ground truth information about 
the sources or the medium. 

Statistical model-based methods can thus be 
more attractive than data-driven machine learn- 
ing approaches for photon-limited imaging, as a 
mathematical forward model g(-) can be com- 
bined with a statistical noise model to better fit 
the data. Physical considerations, such as light 
transport theory through the medium and the 
detector, can often guide model choice, although 
non-physically inspired approximations can be 
used to make the model fitting algorithm more 
computationally tractable. When there is mea- 
surement uncertainty and noise, the forward 
model can be better characterized by the con- 
ditional probability distribution f(y|a), which 
describes the statistical variation of the measure- 
ments y for a given source parameter value x. 
For fixed value y, the function l,(a#) = f(y|x), 
called the likelihood function, quantifies the 
likelihood that the source value a generated 
the observed value y. The maximum likelihood 
principle forms an estimate of # from y by max- 
imizing the likelihood over x. However, the max- 
imum likelihood estimate is often not unique 
in high-dimensional inverse problems such as 
imaging. Fortunately, additional information 
about 2 (e.g., a priori knowledge about positivity, 
smoothness, or sparsity) is often available and 
can be used to improve on the maximum like- 
lihood estimate. For instance, suppose 4 is a reg- 
ularization function such that (a) is small 
when x complies with a priori knowledge and 
is large otherwise. Then it is possible to re- 
cover x by minimizing the cost function C, (a) = 
—log (i, (a) )+ (a).If (a) can be associated 
with a proper density f(a’) < e~®), called the 
prior distribution, this penalized likelihood 
estimation strategy can be interpreted in the 
Bayesian formalism as a maximum a posteriori 
estimation procedure. In other words, the above 
minimization is equivalent to maximizing the 
posterior density of a 


Sf (aly) =f(yle)F(@)/F(y) 


where f(y) is a density that does not depend on 
wx. These and related likelihood-based approaches 
have been adopted by many researchers studying 
low photon imaging (J7, 18, 83). 

The Bayesian formalism allows for the ob- 
served data to be combined with additional in- 
formation in a principled manner. This also 
allows so-called a posteriori measures of uncer- 
tainty to be derived. However, such measures 
cannot be computed analytically in most practi- 
cal applications because of the complexity of accu- 
rate spatial correlation models, and they often 
must be approximated using high-dimensional 
integration. A considerable advantage may be 
gained from computationally simple pixel-by- 
pixel adaptation (91), and a classical approach 
thus consists of approximating these measures 
(e.g., a posteriori variances/covariances or confi- 
dence intervals) using variational approximations 
or simulation techniques. Markov chain Monte 
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Carlo methods are particularly well adapted for 
inference in difficult scenarios for which the cost 
function or posterior distribution of interest has 
multiple modes and multiple solutions poten- 
tially admissible. For instance, such methods 
have been successfully applied to object detec- 
tion (60), and joint material identification 
and anomaly detection (78) from low-flux single- 
photon LIDAR measurements. 


Conclusions 


Considering the rapid advance in imaging cam- 
eras and sensors together with a leap forward in 
computational capacity, we see enormous poten- 
tial for innovation over the next several years. 
The main challenge—or the main opportunity— 
at this stage is the codevelopment of sensors 
and computational algorithms built around the 
physical processes of the photon transport and 
detection mechanisms. We have provided exam- 
ples showing progress in this direction, ranging 
from first-photon imaging techniques to photon- 
starved hyperspectral imaging. The trend seen 
in commercial cameras between 2000 and 2015, 
characterized by a constant drive toward higher 
pixel counts, has slowly subsided, giving way to a 
different approach whereby both performance 
and functionality are increased by combining 
multiple sensors through computational process- 
ing. Obvious examples are recent advances in 
cell-phone technology, arguably one of the main 
drivers behind imaging technology, that now 
boasts multiple cameras and lenses providing 
depth perception, improved signal-to-noise ratio, 
and other functionalities such as 3D face recog- 
nition. With SPAD cameras also gradually making 
their appearance on the commercial scene, single- 
photon imaging and computational techniques 
offer a promising avenue for future innovation in 
situations where previously imaging was thought 
to be impossible. We have briefly discussed ex- 
amples such as imaging though denser scattering 
media (the human body or fog) and full 3D imag- 
ing of scenes around a corner or beyond the di- 
rect line of sight. These examples are particularly 
relevant in demonstrating the progress that can 
be made when the photon transport models and 
computational approaches are integrated with 
the new generation of photon detectors. The 
expectation is that over the next several years 
we will witness substantial growth of computa- 
tional imaging methods, driven by and also 
driving new technologies such as single-photon 
SPAD arrays that will revolutionize nearly every 
aspect of human activity, ranging from med- 
ical diagnostics to urban safety and space 
missions. 
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INTRODUCTION: Wheat (Triticum aestivum L.) 
is the most widely cultivated crop on Earth, 
contributing about a fifth of the total calories 
consumed by humans. Consequently, wheat 
yields and production affect the global econ- 
omy, and failed harvests can lead to social 
unrest. Breeders continuously strive to develop 
improved varieties by fine-tuning genetically 
complex yield and end-use quality parameters 
while maintaining stable yields and adapt- 
ing the crop to regionally specific biotic and 
abiotic stresses. 


RATIONALE: Breeding efforts are limited by 
insufficient knowledge and understanding of 


ANS. 


wheat biology and the molecular basis of cen- 
tral agronomic traits. To meet the demands of 
human population growth, there is an urgent 
need for wheat research and breeding to ac- 
celerate genetic gain as well as to increase and 
protect wheat yield and quality traits. In other 
plant and animal species, access to a fully an- 
notated and ordered genome sequence, includ- 
ing regulatory sequences and genome-diversity 
information, has promoted the development of 
systematic and more time-efficient approaches 
for the selection and understanding of im- 
portant traits. Wheat has lagged behind, pri- 
marily owing to the challenges of assembling 
a genome that is more than five times as large 


Wheat genome deciphered, assembled, and ordered. Seeds, or grains, are what counts with 
respect to wheat yields (left panel), but all parts of the plant contribute to crop performance. With 
complete access to the ordered sequence of all 21 wheat chromosomes, the context of regulatory 
sequences, and the interaction network of expressed genes—all shown here as a circular plot (right 
panel) with concentric tracks for diverse aspects of wheat genome composition—breeders and 
researchers now have the ability to rewrite the story of wheat crop improvement. Details on value 
ranges underlying the concentric heatmaps of the right panel are provided in the full article online. 
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as the human genome, polyploid, and complex, 
containing more than 85% repetitive DNA. To 
provide a foundation for improvement through 
molecular breeding, in 2005, the International 
Wheat Genome Sequencing Consortium set out 
to deliver a high-quality annotated reference 
genome sequence of bread wheat. 


RESULTS: An annotated reference sequence 
representing the hexaploid bread wheat ge- 
nome in the form of 21 chromosome-like se- 
quence assemblies has now been delivered, 
giving access to 107,891 high-confidence genes, 
including their genomic context of regulatory 
sequences. This assembly enabled the discovery 
of tissue- and developmental stage-related gene 
coexpression networks using a transcriptome 
atlas representing all stages of wheat develop- 
ment. The dynamics of change in complex gene 


families involved in environ- 
mental adaptation and end- 
Read the full article  USe quality were revealed at 
at http://dx.doi. subgenome resolution and 


org/10.1126/ 
science.aar7191 


contextualized to known 
agronomic single-gene or 
quantitative trait loci. As- 
pects of the future value of the annotated as- 
sembly for molecular breeding and research 
were exemplarily illustrated by resolving the 
genetic basis of a quantitative trait locus con- 
ferring resistance to abiotic stress and insect 
damage as well as by serving as the basis for 
genome editing of the flowering-time trait. 


CONCLUSION: This annotated reference se- 
quence of wheat is a resource that can now 
drive disruptive innovation in wheat improve- 
ment, as this community resource establishes 
the foundation for accelerating wheat research 
and application through improved understanding 
of wheat biology and genomics-assisted breeding. 
Importantly, the bioinformatics capacity devel- 
oped for model-organism genomes will facilitate 
a better understanding of the wheat genome as 
a result of the high-quality chromosome-based 
genome assembly. By necessity, breeders work 
with the genome at the whole chromosome level, 
as each new cross involves the modification of 
genome-wide gene networks that control the ex- 
pression of complex traits such as yield. With 
the annotated and ordered reference genome 
sequence in place, researchers and breeders can 
now easily access sequence-level information to 
precisely define the necessary changes in the 
genomes for breeding programs. This will be 
realized through the implementation of new DNA 
marker platforms and targeted breeding tech- 
nologies, including genome editing. & 


The list of author affiliations is available in the full article online. 
*Corresponding authors: Rudi Appels (rudi.appels@unimelb. 
edu.au); Kellye Eversole (eversole@eversoleassociates.com); 
Nils Stein (stein@ipk-gatersleben.de) 

Cite this article as International Wheat Genome Sequencing 
Consortium, Science 361, eaar7191 (2018). DOI: 10.1126/ 
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An annotated reference sequence representing the hexaploid bread wheat genome in 

21 pseudomolecules has been analyzed to identify the distribution and genomic context 
of coding and noncoding elements across the A, B, and D subgenomes. With an estimated 
coverage of 94% of the genome and containing 107,891 high-confidence gene models, 
this assembly enabled the discovery of tissue- and developmental stage-related 
coexpression networks by providing a transcriptome atlas representing major stages of wheat 
development. Dynamics of complex gene families involved in environmental adaptation 
and end-use quality were revealed at subgenome resolution and contextualized to known 
agronomic single-gene or quantitative trait loci. This community resource establishes 

the foundation for accelerating wheat research and application through improved 
understanding of wheat biology and genomics-assisted breeding. 


heat (Triticum aestivum L.), the most 

widely cultivated crop on Earth, con- 

tributes about a fifth of the total calories 

consumed by humans and provides more 

protein than any other food source (J, 2). 
Breeders strive to develop improved varieties by 
fine-tuning genetically complex yield and end- 
use quality parameters while maintaining yield 
stability and regional adaptation to specific 
biotic and abiotic stresses (3). These efforts are 
limited, however, by insufficient knowledge and 
understanding of the molecular basis of key 
agronomic traits. To meet the demands of 
human population growth, there is an urgent 
need for wheat research and breeding to ac- 
celerate genetic gain while increasing wheat 
yield and protecting quality traits. In other plant 
and animal species, access to a fully annotated 
and ordered genome sequence, including regu- 
latory sequences and genome-diversity informa- 
tion, has promoted the development of systematic 
and more time-efficient approaches for the selec- 
tion and understanding of important traits (4). 
Wheat has lagged behind other species, primarily 
owing to the challenges of assembling a large 
(haploid genome, 1C = 16 Gb) (5), hexaploid, and 
complex genome that contains more than 85% 
repetitive DNA. 

To provide a foundation for improvement 
through molecular breeding, the International 
Wheat Genome Sequencing Consortium (IWGSC) 
established a road map to deliver a high-quality 
reference genome sequence of the bread wheat 
cultivar Chinese Spring (CS). A chromosome 
survey sequence (CSS) intermediate product as- 
signed 124,201 gene loci across the 21 chromo- 
somes and revealed the evolutionary dynamics of 


*All authors with their affiliations are listed at the end of this paper. 
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the wheat genome through gene loss, gain, and 
duplication (6). The lack of global sequence con- 
tiguity and incomplete coverage (only 10 Gb were 
assembled), however, did not provide the wider 
regulatory genomic context of genes. Subsequent 
whole-genome assemblies improved contiguity 
(7-9) but lacked full annotation and did not re- 
solve the intergenic space or present the genome 
in the correct physical order. 

Here we report an ordered and annotated as- 
sembly (IWGSC RefSeq v1.0) of the 21 chromo- 
somes of the allohexaploid wheat cultivar CS, 
an achievement that is built on a rich history of 
chromosome studies in wheat (J0-12), which 
allowed the integration of genetic and genomic 
resources. The completeness and accuracy of 
IWGSC RefSeq v1.0 provide insights into global 
genome composition and enable the construc- 
tion of complex gene coexpression networks to 
identify central regulators in critical pathways, 
such as flowering-time control. The ability to 
resolve the inherent complexity of gene families 
related to important agronomic traits demon- 
strates the impact of IWGSC RefSeq v1.0 on 
dissecting quantitative traits genetically and 
implementing modern breeding strategies for 
future wheat improvement. 


Chromosome-scale assembly 
of the wheat genome 


Pseudomolecule sequences representing the 21 
chromosomes of the bread wheat genome were 
assembled by integrating a draft de novo whole- 
genome assembly (WGA), built from Illumina short- 
read sequences using NRGene deNovoMagic2 
(Fig. 1A, Table 1, and tables S1 and S2), with addi- 
tional layers of genetic, physical, and sequence 
data (tables S3 to S8 and figs. S1 and $2). In the 
resulting 14.5-Gb genome assembly, contigs and 
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scaffolds with N50s of 52 kb and 7 Mb, respec- 
tively, were linked into superscaffolds (N50 = 
22.8 Mb), with 97% (14.1 Gb) of the sequences 
assigned and ordered along the 21 chromosomes 
and almost all of the assigned sequence scaffolds 
oriented relative to each other (13.8 Gb, 98%). 
Unanchored scaffolds comprising 481 Mb (2.8% 
of the assembly length) formed the “unassigned 
chromosome” (ChrUn) bin. The quality and conti- 
guity of the IWGSC RefSeq v1.0 genome assembly 
were assessed through alignments with radia- 
tion hybrid maps for the A, B, and D subgenomes 
[average Spearman’s correlation coefficient (7) 
of 0.98], the genetic positions of 7832 and 4745 
genotyping-by-sequencing derived genetic mark- 
ers in 88 double haploid and 993 recombinant 
inbred lines (Spearman’s r of 0.986 and 0.987, re- 
spectively), and 1.24 million pairs of neighbor 
insertion site-based polymorphism (ISBP) mark- 
ers (13), of which 97% were collinear and mapped 
in a similar size range (difference of <2 kb) be- 
tween the de novo WGA and the available bacte- 
rial artificial chromosome (BAC)-based sequence 
assemblies. Finally, IWGSC RefSeq v1.0 was as- 
sessed with independent data derived from coding 
and noncoding sequences, revealing that 99 and 
98% of the previously known coding exons (6) and 
transposable element (TE)-derived (ISBP) markers 
(table S9), respectively, were present in the assem- 
bly. The approximate 1-Gb size difference between 
IWGSC RefSeq v1.0 and the new genome size 
estimates of 15.4 to 15.8 Gb (/4) can be accounted 
for by collapsed or unassembled sequences of 
highly repeated clusters, such as ribosomal RNA 
coding regions and telomeric sequences. 

A key feature distinguishing the IWGSC RefSeq 
v1.0 from previous draft wheat assemblies (6-9) 
is the long-range organization, with 90% of the 
genome represented in superscaffolds larger than 
4.1 Mb and with each chromosome represented, 
on average, by only 76 superscaffolds (Table 1). 
The largest superscaffold spans 166 Mb, which is 
half the size of the rice (Oryza sativa L.) genome 
and is larger than the Arabidopsis thaliana L. 
genome (15, 16). Moreover, the 21 pseudomolecules 
position molecular markers for wheat research 
and breeding [504 single-stranded repeats (SSRs), 
3025 diversity array technologies (DArTs), 6689 
expressed sequence tags (ESTs), 205,807 single- 
nucleotide polymorphisms (SNPs), and 4,512,979 
ISBPs] (table S9), thus providing a direct link be- 
tween the genome sequence and genetic loci and 
genes underlying traits of agronomic importance. 


The composition of the wheat genome 


Analyses of the components of the genome se- 
quence revealed the distribution of key elements 
and enabled detailed comparisons of the home- 
ologous A, B, and D subgenomes. Accounting for 
85% of the genome, with a relatively equal dis- 
tribution across the three subgenomes (Table 2), 
3,968,974 copies of TEs belonging to 505 families 
were annotated. Many (112,744) full-length long 
terminal repeat (LTR)-retrotransposons were 
identified that have been difficult to define from 
short-read sequence assemblies (fig. S3). Although 
the TE content has been extensively rearranged 
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Fig. 1. Structural, functional, and conserved synteny landscape of 

the 21 wheat chromosomes. (A) Circular diagram showing genomic 
features of wheat. The tracks toward the center of the circle display (a) 
chromosome name and size (100-Mb tick size; light gray bar indicates the 
short arm and dark gray indicates the long arm of the chromosome); 

(b) dimension of chromosomal segments R1, R2a, C, R2b, and R3 [(18) and 
table S29]; (c) K-mer 20-frequencies distribution; (d) LTR-retrotransposons 
density; (e) pseudogenes density (O to 130 genes per Mb); (f) density of 
HC gene models (0 to 32 genes per Mb); (g) density of recombination rate; 
and (h) SNP density. Connecting lines in the center of the diagram 
highlight homeologous relationships of chromosomes (blue lines) and 
translocated regions (green lines). (B) Distribution of Pfam domain 
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PFO08284 “retroviral aspartyl protease” signatures across the different 
wheat chromosomes. (C) Positioning of the centromere in the 2D 
pseudomolecule. Top panel shows density of CENH3 ChIP-seq data along 
the wheat chromosome. Bottom panel shows distribution and proportion 

of the total pseudomolecule sequence composed of TEs of the Cereba and 
Quinta families. The bar below the bottom panel indicates pseudomolecule 
scaffolds assigned to the short (black) or long (blue) arm on the basis 

of CSS data (6) mapping. (D) Dot-plot visualization of collinearity between 
homeologous chromosomes 3A and 3B in relation to distribution of 

gene density and recombination frequency (left and bottom panel boxes: 
blue and purple lines, respectively). Chromosomal zones RI1, R2a, C, 

R2b, and R3 are colored as in (A). cM, centimorgan. 


International Wheat Genome Sequencing Consortium (IWGSC), Science 361, eaar7191 (2018) 17 August 2018 2 of 13 


810z ‘9} IsnBny uo /610'beweoualos’e0ua!0S//:d}]y Wo.) pepeo|juMOGg 


RESEARCH | RESEARCH ARTICLE 


through rounds of deletions and amplifications 
since the divergence of the A, B, and D subge- 
nomes about 5 million years ago, the TE families 
that shaped the Triticeae genomes have been 
maintained in similar proportions: 76% of the 
165 TE families present in a cumulative length 
greater than 1 Mb contributed similar propor- 
tions (less than a twofold difference between sub- 
genomes), and only 11 families, accounting for 2% 
of total TEs, showed a higher than threefold dif- 
ference between two subgenomes (17). TE abun- 
dance accounts, in part, for the size differences 
between subgenomes—for example, 64% of the 
1.2-Gb size difference between the B and D sub- 
genomes can be attributed to lower gypsy retro- 
transposon content. Low-copy DNA content 
(primarily unclassified sequences) also varied be- 
tween subgenomes, accounting, for example, for 
97 Mb of the 245-Mb size difference between A 
and B subgenomes (fig. S4). As reported (78), no 
evidence was found for a major burst of transpo- 
sition after polyploidization. The independent 
evolution in the diploid lineages was reflected in 
differences in the specific composition of the A, 
B, and D subgenomes at the subfamily (variants) 
level, as evidenced by subgenome-specific over- 


representation of individual transposon domain 
signatures (Fig. 1B). See (17) for a more detailed 
analysis of the TE content and its impact on the 
evolution of the wheat genome. 

In addition to TEs, annotation of the inter- 
genic space included noncoding RNAs. We iden- 
tified eight new microRNA families (fig. S5 and 
table S10) and the entire complement of tRNAs 
(which showed an excess of lysine tRNAs, fig. S6). 
Around 8000 nuclear-inserted plastid DNA seg- 
ments and 11,000 nuclear-inserted mitochondrial 
DNA segments representing, respectively, 5 and 
17 Mb were also revealed by comparing the ge- 
nome assembly with complete plastid and mito- 
chondrial genomes assembled from the IWGSC 
RefSeq v1.0 raw read data (74). 

Precise positions for the centromeres were 
defined by integrating Hi-C, CSS (6), and pub- 
lished chromatin immunoprecipitation sequencing 
(ChIP-seq) data for CENH3, a centromere-specific 
histone H3 variant (19). Clear ChIP-seq peaks 
were evident in all chromosomes and coincided 
with the centromere-specific repeat families (Fig. 
1C, fig. $7, and table S11). CENH3 targets were 
also found in unassigned sequence scaffolds 
(ChrUn), indicating that centromeres of several 


_—— ey) 
Table 1. Assembly statistics of IWGSC RefSeq v1.0. 


Assembly characteristics 


Assembly size 


Size of assembly in scaffolds = 100 kb 


Contig L50 number 
Contig L90 number 
Ns in contigs 
Scaffold L50 number 
Largest scaffold 


Gaps filled with BAC sequences 


N50 superscaffold length 


N90 superscaffold length 


Average number of superscaffolds per chromosome 
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81,427 


294,934 


571 


45.8 Mb 


183 (17 Mb) 


22.8 Mb 


14.1 Gb (96.8%) 


1,601 


13.8 Gb (95%) 
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chromosomes are not yet completely resolved. On 
the basis of these data, a conservative estimate for 
the minimal average size of a wheat centromere is 
4.9 Mb (6.7 Mb, if including ChrUn; table S11), com- 
pared with an average centromere size of ~1.8 Mb 
in maize (20, 27) and 0.4 to 0.8 Mb in rice (22). 

Gene models were predicted with two inde- 
pendent pipelines previously utilized for wheat 
genome annotation and then consolidated to pro- 
duce the RefSeq Annotation v1.0 (fig. S8). Subse- 
quently, a set of manually curated gene models 
was integrated to build RefSeq Annotation v1.1 
(fig. S9 and tables S12 to S17). In total, 107,891 
high-confidence (HC) protein-coding loci were 
identified, with relatively equal distribution across 
the A, B, and D subgenomes (35,345, 35,643, and 
34,212, respectively; Figs. 1D and 2A, fig. S10, and 
table S18). In addition, 161,537 other protein- 
coding loci were classified as low-confidence (LC) 
genes, representing partially supported gene 
models, gene fragments, and orphans (table S18). 
A predicted function was assigned to 82.1% 
(90,919) of HC genes in RefSeq Annotation v1.0 
(tables S19 and S20), and evidence for transcrip- 
tion was found for 85% (94,114) of the HC genes 
versus 49% of the LC genes (23). Within the 
pseudogene category, 25,419 (8%) of 303,818 
candidates matched LC gene models. The D sub- 
genome contained significantly fewer pseudo- 
genes than the A and B subgenomes (81,905 versus 
99,754 and 109,097, respectively; y” test, P < 2.2 x 
10-1) (tables $21 and $22 and fig. S10). In ChrUn, 
2691 HC and 675 LC gene models were identified. 

The quality of the RefSeq Annotation v1.1 gene 
set was benchmarked against BUSCO v3 (24), 
representing 1440 Embryophyta near-universal 
single-copy orthologs and published annotated 
wheat gene sets (Fig. 2B and fig. S11). Of the 
BUSCO v3 genes, 99% (1436) were represented 
in at least one complete copy in RefSeq Annota- 
tion v1.1 and 90% (1292) in three complete copies, 
an improvement over the 25% (353) and 70% 
(1014) of BUSCO v3 genes that were identified 
in the IWGSC (6) and TGACvI1 (8) gene sets, 
respectively (Fig. 2B). Improved contiguity of 
sequences in the immediate vicinity of genes 
was also found: 61% of the HC and LC genes 
were flanked by at least 10 kb of sequence 
without ambigous bases (Ns), in contrast to 
37% and only 5% of the HC and LC genes in the 
TGACv1 and IWGSC CSS gene models, respec- 
tively (fig. S12). 

To further characterize the gene space, a 
phylogenomic approach was applied to identify 
gene homeologs and paralogs between and with- 
in the wheat subgenomes and orthologs in other 
plant genomes (table $23 and figs. S13 to S15). 
Analysis of a subset of 181,036 genes [“filtered 
gene set,” (J4) and Table 3] comprising 103,757 
HC and 77,279 LC genes identified 39,238 
homeologous groups—that is, clades of A, B, and 
D subgenome orthologs deduced from gene trees— 
containing a total of 113,653 genes (63% of the 
filtered set). Gene losses or retention and gene 
gains (gene duplications) were determined for 
all homeologous loci of IWGSC RefSeq v1.0 (Table 
3), assuming the presence of a single gene copy at 
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every homeologous locus (referred to as a “triad”). 
The percentage of genes in homeologous groups 
for all configurations (ratios) is highly similar, 
hence balanced, across the three subgenomes: 
63% (A), 61% (B), and 66% (D). The slightly higher 
percentage of homeologs in the D subgenome, 
together with the lower number of pseudogenes 
(table S22), is consistent with its more recent 
hybridization with the AABB tetraploid genome 
progenitor. Although most of the genes are present 
in homeologous groups, only 18,595 (47%) of the 
groups contained triads with a single gene copy 
per subgenome (an A:B:D configuration of 1:1:1). 
Of the groups of homeologous genes, 5673 (15%) 
exhibited at least one subgenome inparalog, that 
is, a gene copy resulting from a tandem or a seg- 
mental trans duplication (1:1:N A:B:D configura- 
tion; N indicates a minimum of one additional 
paralog per respective subgenome). The three 
genomes exhibited similar levels of loss of indi- 
vidual homeologs, affecting 10.7% (0:1:1), 10.3% 
(1:0:1), and 9.5% (1:1:0) of the homeologous groups 
in the A, B, and D subgenomes, respectively (Table 
3 and tables S24 and S25). 

Of the 67,383 (37%) genes of the filtered set not 
present in homeologous groups, 31,140 genes 
also had no orthologs in species included in the 
comparisons outside of bread wheat and mainly 
comprised gene fragments, non-protein-coding 
loci with open reading frames, or other gene- 
calling artifacts. The remaining 36,243 genes 
had homologs outside of bread wheat and ap- 
peared to be subgenome specific (Table 3). Two 
of the genes in this category were granule bound 
starch synthase (GBSS) on chromosome 4A (1:0:0, 
a gene that is a key determinant of udon noodle 
quality) and Z/P4 within the pairing homeologous 
I (PAI) locus on chromosome 5B [0:1:0, a locus 
critical for the diploid meiotic behavior of the 
wheat homeologous chromosomes (25)]. The 
phylogenomic analysis indicated that the GBSS 
on 4A is a divergent translocated homeolog 
originally located on chromosome “7B (fig. S16), 
whereas Z/P4 is a transduplication of a chro- 
mosome 3B locus (table S26). Both genes confer 
important properties on wheat and illustrate the 
diversity in origin and function of gene models 
that are not in a 1:1:1 configuration. No evidence 
was found for biased partitioning. Rather, our 
analyses support gradual gene loss and gene 
movement among the subgenomes that may 
have occurred in either the diploid progenitor 
species or the tetraploid ancestor or following 
the final hexaploidization event in modern bread 
wheat (Table 3 and figs. S24 and S25). Together 
with the equal contribution of the three homeol- 
ogous genomes to the overall gene expression 
(23), this demonstrates the absence of subge- 
nome dominance (26). 

Of the bread wheat HC genes, 29,737 (27%) are 
present as tandem duplicates, which is up to 10% 
higher than that found for other monocotyle- 
donous species (table S27). Tandemly repeated 
genes are most prevalent in the B subgenome 
(29%), contributing to its higher gene content 
and larger number of 1:N:1 homeologous groups 
(Table 3). The postulated hybrid origin of the D 


International Wheat Genome Sequencing Consortium (IWGSC), Science 361, eaar7191 (2018) 


subgenome, as a result of interspecific crossing 
with AABB tetraploid genome progenitors 1 to 
2 million years after they diverged (27), is con- 
sistent with the synonymous substitution rates 
of homeologous gene pairs (fig. S17). Homeol- 
ogous groups with gene duplicates in at least 
one subgenome (1:1:N, 1:N:1, or N:1:1) showed 
elevated evolutionary rates (for the subgenome 


carrying the duplicate) as compared with strict 
1:1:1 or 1:1 groups (figs. $18 to S22). Homeologs 
with recent duplicates also showed higher levels 
of expression divergence (fig. S23), consistent 
with gene and genome duplications acting as a 
driver of functional innovation (28, 29). 

Analysis of synteny between the seven triplets 
of homeologous chromosomes showed high levels 
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Fig. 2. Evaluation of automated gene annotation. (A) Selected gene prediction statistics of IWGSC 
RefSeq Annotation v1.1, including number and subgenome distribution of HC and LC genes as well 

as pseudogenes. (B) BUSCO v3 gene model evaluation comparing IWGSC RefSeq Annotation v1.1 to 
earlier published bread wheat whole-genome annotations, as well as to annotations of related grass 
reference-genome sequences. BUSCO provides a measure for the recall of highly conserved gene models. 
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of conservation. There was no evidence that any 
major rearrangements occurred since the A, B, 
and D subgenomes diverged ~5 million years ago 
(Fig. 1D), although collinearity between homeo- 
logs was disturbed by inversions occurring, on 
average, every 74.8 Mb, involving blocks of 10 
genes or more (mean gene number of 48.2 with 
a mean size of 10.5 Mb) (Fig. 1D and table S28). 
Macrosynteny was conserved across centromere 
(C) regions, but collinearity (microsynteny) broke 
down specifically in these recombination-free, 
gene-poor regions for all seven sets of homeol- 
ogous chromosomes (Fig. 1D, figs. S24 to S26, 
and table S29). Of the 113,653 homeologous genes, 
80% (90,232) were found organized in macro- 
synteny, that is, still present at their ancestral 
position (table $24). At the microsynteny scale, 
72% (82,308) of the homeologs were organized 
in collinear blocks, that is, intervals with a highly 
conserved gene order (Fig. 1D). A higher propor- 
tion of syntenic genes was found in the intersti- 
tial regions [short arm, R2a (18), 46% and long 
arm, R2b (78), 61%] than in the distal telomeric 
[short arm, R1 (/8), 39% and long arm, R3 (J8), 
51%] and centromere regions [C (78), 29%], and 
the interstitial compartments harbored larger 
syntenic blocks (figs. S27 and S28). The higher 
proportions of duplicated genes in distal-terminal 
regions (34 and 27% versus 13 to 15% in the other 


regions; fig. S29) exerted a strong influence on the 
decay of syntenic block size and contributed to the 
higher sequence variability in these regions. Over- 
all, distal chromosomal regions are the preferen- 
tial targets of meiotic recombination and the 
fastest evolving compartments. As such, they 
represent the genomic environment for creating 
sequence, hence allelic, diversity, providing the 
basis for adaptability to changing environments. 


Atlas of transcription reveals trait- 
associated gene co-regulation networks 


The gene annotation, coupled with identification 
of homeologs and paralogs in IWGSC RefSeq 
v1.0, provides a resource to study gene expression 
in genome-wide and subgenome contexts. A total 
of 850 RNA-seq samples derived from 32 tissues 
at different growth stages and/or challenged 
by different stress treatments were mapped to 
RefSeq Annotation v1.0 (Fig. 3A, database S1, and 
tables S30 to S32). Expression was observed for 
94,114 (84.9%) HC genes (fig. S30) and for 77,920 
(49.1%) LC genes, the latter showing lower ex- 
pression breadth and level [median six tissues; 
average 2.9 transcripts per million (tpm)] than 
the HC genes (median 20 tissues, average 8.2 tpm) 
(fig. S31). This correlated with the higher average 
methylation status of LC genes (figs. S32 and $33). 
A principal component analysis identified tissue 


Table 2. Relative proportions of the major elements of the wheat genome. Proportions of TEs 
are given as the percentage of sequences assigned to each superfamily relative to genome size. 
Abbreviations in parentheses under the headings “Class 1” and “Class 2” indicate transposon types. 
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(Fig. 3B), rather than growth stage or stress 
(fig. S34), as the main factor driving differential 
expression between samples, consistent with 
studies in other organisms (30-33). Of the total 
number of genes, 31.0% are expressed in more 
than 90% of tissues (average 16.9 tpm, =30 tissues), 
and 21.5% are expressed in 10% or fewer tissues 
(average 0.22 tpm, <3 tissues; fig. S31). 

Of the HC genes, 8231 showed tissue-exclusive 
expression (fig. S35). About half of these were 
associated with reproductive tissues (microspores, 
anther, and stigma or ovary), consistent with ob- 
servations in rice (34). The tissue-exclusive genes 
were enriched for response to extracellular stimuli 
and reproductive processes (database S2). By con- 
trast, 23,146 HC genes expressed across all 32 
tissues were enriched for biological processes 
associated with housekeeping functions such as 
protein translation and protein metabolic pro- 
cesses. Tissue-specific genes were shorter [1147 + 
8 base pairs (bp)], had fewer exons (2.76 + 0.3), 
and were expressed at lower levels (3.4 + 0.1 tpm) 
compared with ubiquitous genes (1429 + 7 bp, 
7.87 + 0.4 exons, and 17.9 + 0.4 tpm) (fig. S35). 

Genes located in distal regions R1 and R3 
(fig. S25 and table S29) showed lower expression 
breadth than those in the proximal regions (15.7 
and 20.7 tissues, respectively) (Fig. 3C and fig. S36). 
This correlated with enrichment of Gene Ontology 
(GO) slim terms such as “cell cycle,” “translation,” 
and “photosynthesis” for genes in the proximal 
regions, whereas genes enriched for “response to 
stress” and “external stimuli” were found in the 
highly recombinant distal R1 and R3 regions 
(database S3, fig. S36, and table S33). The expres- 
sion breadth pattern was also correlated with the 
distribution of the repressive H3K27me3 (trime- 
thylated histone H3 lysine 27) (Pearson 7 = —0.76, 
P< 2.2 x 10°) and with the active H3K36me3 
and H3K9ac (acetylated H3K9) (Pearson r = 0.9 
and 0.83, respectively; P < 2.2 x 101°) histone 
marks (fig. S37). 

Global patterns of coexpression (35) were deter- 
mined with a weighted gene coexpression network 
analysis (WGCNA) on 94,114 expressed HC genes. 
Of these genes, 58% (54,401) could be assigned to 
38 modules (Fig. 3D and database S4), and, con- 
sistent with the principal component analysis, 
tissues were the major driver of module identity 
(Fig. 3D and figs. S38 to S40). The analysis fo- 
cused initially on the 9009 triads (syntenic and 
nonsyntenic) with a 1:1:1 A:B:D relationship and 
for which all homeologs were assigned to a 
module. Of the triads, 16.4% had at least one 
homeolog in a divergent module, with the B 
homeolog most likely to be divergent (37.4% 
B-divergent versus 31.7% A-divergent and 30.9% 
D-divergent triads, x? test P = 0.007). However, 
the expression profiles of most (83.6%) of the triads 
were relatively consistent with all homeologs in 
the same (57.6%) or a closely related (26.0%) mod- 
ule. The proportion of homeologs found within the 
same module was higher than expected, pointing 
to a highly conserved expression pattern of homeo- 
logs across the 850 RNA-seq samples (Fig. 3E and 
table S34). Triads with at least one gene in a 
nonsyntenic position had a higher amount of 
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Fig. 3. Wheat atlas of transcription. (A) Schematic illustration of a mature 
wheat plant and high-level tissue definitions for “roots,” “leaves,” “spike,” and 
“grain” used in the further analysis. (B) Principal component (PC) analysis 
plots for similarity of overall transcription, with samples colored according to 
their high-level tissue of origin [as introduced in (A)]. The color key for 

tissue is shown at the bottom of the figure under (C). (©) Chromosomal 
distribution of the average expression breadth [number of tissues in which 
genes are expressed (total number of tissues, n = 32)]. The average (dark 
orange line) is calculated on the basis of a scaled position of each gene within 
the corresponding genomic compartment (blue, aqua, and light yellow 
background) across the 21 chromosomes (orange lines). (D) Heatmap 
illustrating the expression of a representative gene (eigengene) for the 38 
coexpression modules defined by WGCNA. Modules are represented as 
columns, with the dendrogram illustrating eigengene relatedness. Each row 
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represents one sample. Colored bars to the left indicate the high-level tissue of 
origin; the color key is shown at the bottom of the figure under (C). DESeq2- 
normalized expression levels are shown. Modules 1 and 5 (light green boxes) 
were most correlated with high-level leaf tissue, whereas modules 8 and 11 
(dark green boxes) were most correlated with spike. (E) Bar plot of module 
assignment (same, near, or distant) of homeologous triads and duplets in the 
WGCNA network. (F) Simplified flowering pathway in polyploid wheat. Genes 
are colored according to their assignment to leaf (light green)— or spike 
(dark green)-correlated modules. (G) Excerpt from phylogenetic tree for 
MADS transcription factors, including known Arabidopsis flowering regulators 
SEP1, SEP2, and SEP4 (black) (for the full phylogenetic tree, see fig. S38). 
Green branches represent wheat orthologs of modules 8 and 11, whereas 
purple branches are wheat orthologs assigned to other modules (O and 2). 
Gray branches indicate non-wheat genes. 
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assemblies (53, 54). Rows in the bottom panel heatmap (color scheme on z-score scale) indicate the fold expansion and contraction of gene families for the 
taxa and species included in the analysis [Oryza sativa (Osat), Sorghum bicolor (Sbic), Zea mays (Zmay), Brachypodium distachyon (Bdis), Hordeum vulgare 


(Hvull/2), Secale cereale (Scer), A. tauschii (Aetau), T. urartu (Tura), and wheat A (TraesA), B (TraesB), and D (TraesD) subgenomes]. (B) All enriched TO terms for 
the gene families depicted in (A). Overrepresented TO terms were found for expanded families in bread wheat (all subgenomes, red), the B subgenome (green), 
and the A lineage (T. urartu and A subgenome, blue) only, respectively. The x axis represents the percentage of genes annotated with the respective TO term 
that were contained in the gene set in question. The size of the bubbles corresponds to the P (—logio) significance of expansion. (©) Genomic distribution of 
gene families associated with adaptation to biotic (light and dark blue) or abiotic stress (light and dark pink), RNA metabolism in organelles and male fertility 
(orange), or end-use quality (light, medium, and dark green). Known positions of agronomically important genes and loci are indicated by red arrows and arrowheads 
to the left of the chromosome bars. Recombination rates are displayed as heatmaps in the chromosome bars [7.2 cM/Mb (light green) to O cM/Mb (black)]. 
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divergent expression patterns compared to syntenic 
triads (21.2 versus 16.2%, y” test P < 0.001) and 
fewer such triads shared all homeologs in the same 
module (48.7%) compared to syntenic triads (58.0%, 
chi-square test P = 0.009). Similar patterns were 
observed in the 1933 duplets that have a 1:1 re- 
lationship between only two homeologs (table 
S34). These results are consistent with syntenic 
homeologs showing similar expression patterns, 
whereas more dramatic changes in chromosome 
context associate with divergent expression and 
possible sub- or neofunctionalization. These trends 
were also found across diverse tissue-specific 
networks (23). 

To explore the potential of the WGCNA net- 
work for identifying previously uncharacterized 
pathways in wheat, a search was undertaken for 
modules containing known regulators of wheat 
flowering time [e.g., PPD (36) and FT (37); Fig. 
3F]. Genes belonging to this pathway were 
grouped into specific modules. The upstream 
genes (PHYB, PHYC, PPD1, ELF3, and VRN2) 
were present mainly in modules 1 and 5 and 
were most highly correlated with expression in 
leaf and shoot tissues (0.68 and 0.67, respectively; 
adjusted P < 1 x 10°1°°). By contrast, the inte- 
grating gene FT and downstream genes VRN1, 
FUL2, and FUL3 were found in modules 8 and 
11, most highly correlated with expression in 
spikes (0.69 and 0.65, respectively; adjusted P < 
1x 10°11; table $35). The MADS_II transcrip- 
tion factor family that is generally associated 
with the above pathways was examined more 
closely, with a focus on the gene tree OG0000041, 
which contains 54 of the 118 MADS_II genes in 
wheat. Twenty-four MADS_II genes from mod- 
ules 8 and 11 were identified within this gene 
tree, clustering into two main clades along with 
Arabidopsis and rice orthologs associated with 
floral patterning (fig. S41 and database S5). 
Within these clades, other MADS_II genes were 
found that were not in modules 8 or 11 (Fig. 3G), 
indicating a different pattern of coexpression. 
None of the 24 MADS_II genes had a simple 1:1 
ortholog in Arabidopsis, suggesting that some 
wheat orthologs function in flowering (those 
within modules 8 and 11), whereas others could 
have developed different functions, despite being 
phylogenetically closely related. Thus, these data 
provide a framework to identify and prioritize 
the most likely functional orthologs of known 
model system genes within polyploid wheat, to 
characterize them functionally (38), and to dis- 
sect genetic factors controlling important agro- 
nomic traits (39, 40). A more detailed analysis 
of tissue-specific and stress-related networks (23) 
provides a framework for defining quantitative 
variation and interactions between homeologs 
for many agronomic traits (41). 


Gene-family expansion and contraction 
with relevance to wheat traits 


Gene duplication and gene-family expansion are 
important mechanisms of evolution and environ- 
mental adaptation, as well as major contributors 
to phenotypic diversity (42, 43). In a phyloge- 
nomic comparative analysis, wheat gene-family 
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size and wheat-specific gene-family expansion 
and contraction were benchmarked against nine 
other grass genomes, including five closely related 
diploid Triticeae species (table S23 and figs. S13 


to S15 and S42). A total of 30,597 gene families 
(groups of orthologous genes traced to a last 
common ancestor in the evolutionary hierarchy 
of the compared taxa) were defined, with 26,080 
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Fig. 5. IWGSC RefSeq v1.0-guided dissection of SSt1 and TaAGL33. (A) The Lillian-Vesper 
population genetic map was anchored to IWGSC RefSeq v1.0 (left), and differentially expressed genes were 
identified between solid- and hollow-stemmed lines of hexaploid (bread) and tetraploid (durum) wheat 
(right). (B) Cross-sectioned stems of Lillian (solid) and Vesper (hollow) are shown as a phenotypic 
reference (top). Increased copy number of TraesCS3BO1G608800 [annotated as a DOF (DNA-binding 
one-zinc finger) transcription factor] is associated with stem phenotypic variation (bottom). (C) A high- 
throughput SNP marker tightly linked to TraesCS3BO1G608800 reliably discriminates solid- from hollow- 
stemmed wheat lines. Relative intensity of the fluorophores (FAM and HEX) used in KASPar analysis are 
shown. Vertical axis shows FAM signal: horizontal axis shows HEX signal. (D) Schematic of the three 
TaAGL33 proteins, showing the typical MADS, I, K, and C domains. Triangles indicate the position of the five 
introns that occur in all three homeologs. Bars indicate the position of single-guide RNAs designed for 
exons 2 and 3. Three T-DNA vectors—each containing the bar selectable marker gene, CRISPR nuclease, 
and one of three single-guide RNA sequences—were used for Agrobacterium-mediated wheat 
transformation, essentially as described earlier (55). Transgenic plants were obtained with edits at the 
targeted positions in all TaAGL33 homeologs. The putatively resulting protein sequence is displayed 
starting close to the edits, with wild-type amino acids (aa) in black font and amino acids resulting from the 
induced frame shifts in red font. * indicates premature termination codons. (E) Mean days to flowering 
(after 8 weeks of vernalization) for progeny of four homozygous edited plants (light gray bars) and the 
respective homozygous wild-type segregants (dark gray bars). Numbers in parentheses refer to the 
number of edited and wild-type plants examined, respectively. Error bars display SEM. Growth conditions 


were as described in (50). 
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families containing gene members from at least 
one of the three wheat subgenomes (tables S36 
to S39). Among the 8592 expanded wheat gene 
families (33% of all families), 6216 were expanded 
in all three A, B, and D subgenomes (24%; either 
shared with the wild ancestor or specific to bread 
wheat, Fig. 4A). Another 1109 were expanded 
in only one of the wheat subgenomes, and 2102 
gene families were expanded in either the A or 
the D genome lineages (Fig. 4A, fig. S43, and 
table S36). Overall, only 78 gene families were 
contracted in wheat. The number of gene fam- 
ilies that are only expanded in wheat may be 
overestimated owing to limited completeness 
of the draft progenitor wheat genome assem- 
blies used in this study (14) (table S39). Gene 
Ontology (GO; ontology of biomedical terms 
for the areas “cellular component,” “biological 
process,” and “molecular function”), Plant Ontol- 
ogy (PO; ontology terms describing anatomical 
structures and growth and developmental stages 
across Viridiplantae), and Plant Trait Ontology 
[TO; ontology of controlled vocabulary to de- 
scribe phenotypic traits and quantitative trait 
loci (QTLs) that were physically mapped to a 
gene in flowering plant species] analyses iden- 
tified 1169 distinct GO, PO, and TO terms (15% 
of all assigned terms) enriched in genes belonging 
to expanded wheat gene families (Fig. 4B and 
figs. S44 and S45). “A-subgenome” or “A-lineage” 
expanded gene families showed a bias for terms 
associated with seed formation [overrepresentation 
of the TO term “plant embryo morphology” 
(TO:0000064) and several seed, endosperm, and 


embryo-developmental GO terms] (fig. S46). Sim- 
ilarly, “B-subgenome” expanded gene families 
were enriched for TO terms related to plant 
vegetative growth and development (database 
S6 and fig. S47). Gene families that were ex- 
panded in all wheat subgenomes were enriched 
for 14 TO terms associated with yield-affecting 
morphological traits and five terms associated 
with fertility and abiotic-stress tolerance (Fig. 4B), 
which was also mirrored by enrichment for GO 
and PO terms associated with adaptation to 
abiotic stress (“salt stress” and “cold stress”) and 
grain yield and quality (“seed maturation,” “dor- 
mancy,” and “germination”). The relationship 
between the patterns of enriched TO, PO, and 
GO terms for expanded wheat gene families and 
key characteristics of wheat performance (figs. 
S45 to S51) provides a resource (database S6) 
to explore future QTL mapping and candidate 
gene identification for breeding. 

Many gene families with high relevance to 
wheat breeding and improvement were among 
the expanded group, and their genomic distri- 
bution was analyzed in greater detail (Fig. 4C 
and figs. S52 to S54). Disease resistance-related 
NLR (nucleotide-binding site leucine-rich repeat)- 
like loci and WAK (wall-associated receptor)-like 
genes were clustered in high numbers at the 
distal (R1 and R3) regions of all chromosome 
arms, with NLRs often co-localizing with known 
disease resistance loci (Fig. 4C). The restorer-of- 
fertility-like (RFL) subclade of P class penta- 
tricopeptide repeat (PPR) proteins, potentially 
of interest for hybrid wheat production, com- 


prised 207 genes, nearly threefold more per 
haploid subgenome than have been identified 
in any other plant genome analyzed to date 
(44, 45). They localized mainly as clusters of 
genes in regions on the group 1, 2, and 6 chro- 
mosomes, which carry fertility-restoration QTLs 
in wheat (Fig. 4C and fig. S54). Within the de- 
hydrin gene family, implicated with drought 
tolerance in plants, 25 genes that formed well- 
defined clusters on chromosomes 6A, 6B, and 
6D (figs. S53 and S55) showed early increased 
expression under severe drought stress. As the 
structural variation in the CBF genes of wheat 
is known to be associated with winter survival 
(46), the array of CBF paralogs at the Fr-2 locus 
(fig. S56) revealed by IWGSC RefSeq v1.0 pro- 
vides a basis for targeted allele mining for pre- 
viously uncharacterized CBF haplotypes from 
highly frost-tolerant wheat genetic resources. 
Lastly, high levels of expansion and variation 
in members of grain prolamin gene families 
[fig. S52 and (47)] either related to the response 
to heat stress or whose protein epitopes are 
associated with levels of celiac disease and food 
allergies (47) provide candidates for future selec- 
tion in breeding programs. From these few ex- 
amples, it is evident that flexibility in gene copy 
numbers within the wheat genome has contri- 
buted to the adaptability of wheat to produce 
high-quality grain in diverse climates and envi- 
ronments (48). Knowledge of the complex picture 
of the genome-wide distribution of gene fami- 
lies (Fig. 4C), which needs to be considered for 
selection in breeding programs in the context of 


Table 3. Groups of homeologous genes in wheat. Homeologous genes are 
“subgenome orthologs” and were inferred by species tree reconciliation in the 
respective gene family. Numbers include both HC and LC genes filtered for TEs 
(filtered gene set). Conserved subgenome-specific (orphan) genes are found 

only in one subgenome but have homologs in other plant genomes used in this 
study. This includes orphan outparalogs resulting from ancestral duplication 

events and conserved only in one of the subgenomes. Nonconserved orphans 


are either singletons or duplicated in the respective subgenome, but neither 
have obvious homologs in the other subgenomes or the other plant genomes 
studied. Microsynteny is defined as the conservation and collinearity of local 
gene ordering between orthologous chromosomal regions. Macrosynteny is 
defined as the conservation of chromosomal location and identity of genetic 
markers like homeologs but may include the occurrence of local inversions, 
insertions, or deletions. Additional data are presented in table S24. 
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Number in wheat Composition of Number of Number of Number of Total number of 
Homeologous group (A:B:D) is : : 
genome groups (%) genes inA genes inB genes in D genes 
allel 21,603 onl 21,603 21,603 21,603 64,809 


Other ratios 


3,270 8.3 

:1:1 in microsynteny 18,595 474 18,595 18,595 55,785 
ee ee a ne ee fete teatiie iti NOTRE AT oe ae We eran! Sees Sarre AGE oe 

:1:1 in macrosynteny 19,701 50.2 19,701 19,701 BoMOS 
See cane BH ies et areata ee Ba he ae as ee Shen 
Total in homeologous groups 39,238 100.0 38,680 37,212 113,653 
Conserved subgenome orphans 12,987 10,844 36,243 

N 


Nonconserved subgenome 
duplicated orphans 
Total (filtered) 


71 83 38 iy 


60,328 63,935 181,036 
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distribution of recombination and allelic diver- 
sity, can now be applied to wheat improvement 
strategies. This is especially true if “must-have 
traits” that are allocated in chromosomal com- 
partments with highly contrasting character- 
istics are fixed in repulsion or are found only in 
incompatible gene pools of the respective breed- 
ing germplasm. 


Rapid trait improvement using 
physically resolved markers 
and genome editing 


The selection and modification of genetic var- 
iation underlying agronomic traits in breeding 
programs is often complicated if phenotypic 
selection depends on the expression of multiple 
loci with quantitative effects that can be strongly 
influenced by the environment. This dilemma 
can be overcome if DNA markers in strong link- 
age disequilibrium with the phenotype are iden- 
tified through forward genetic approaches or if 
the underlying genes can be targeted through 
genome editing. The potential for IWGSC RefSeq 
v1.0, together with the detailed genome annota- 
tion, to accelerate the identification of potential 
candidate genes underlying important agronomic 
traits was exemplified for two targets. A forward 
genetics approach was used to fully resolve a 
QTL for stem solidness (SStZ) conferring resist- 
ance to drought stress and insect damage (49) 
that was disrupted in previous wheat assem- 
blies by a lack of scaffold ordering and anno- 
tation, partial assembly, and/or incomplete gene 
models (fig. S57 and tables S40 and S41). In 
IWGSC RefSeq v1.0, SS¢7 contains 160 HC genes 
(table S42), of which 26 were differentially ex- 
pressed (DESeq2, Benjamini-Hochberg adjusted 
P < 0.01) between wheat lines with contrasting 
phenotypes. One of the differentially expressed 
genes, TraesCS3BO01G608800, was present as a 
single copy in IWGSC RefSeq v1.0 but showed 
copy number variation associated with stem 
solidness in a diverse panel of hexaploid cul- 
tivars (Fig. 5A, fig. S58, and table S43). Using 
IWGSC RefSeq v1.0, we developed a diagnostic 
SNP marker physically linked to the copy num- 
ber variation that has been deployed to select 
for stem solidness in wheat breeding programs 
(Fig. 5B). 

Knowledge from model species can also be 
used to annotate genes and provide a route to 
trait enhancement through reverse genetics. The 
approach here targeted flowering time, which is 
important for crop adaptation to diverse envi- 
ronments and is well studied in model plants. Six 
wheat homologs of the FLOWERING LOCUS C 
(FLC) gene have been identified as having a 
role in the vernalization response, a critical pro- 
cess regulating flowering time (50). IWGSC RefSeq 
vLO was used to refine the annotation of these 
six sequences to identify four HC genes and 
then to design guide RNAs to specifically tar- 
get, with CRISPR-Cas9-based gene editing, one 
of these genes, TaAGL33, on all subgenomes 
[TraesCS3A01G435000 (A), TraesCS3B01G470000 
(B), and TraesCS3D01G428000 (D)] [Fig. 5C and 
(14)]. Editing was obtained at the targeted gene 
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and led to truncated proteins after the MADS 
box through small deletions and insertions (Fig. 
5D). Expression of all homeologs was high before 
vernalization, dropped during vernalization, and 
remained low post-vernalization, implying a role 
for this gene in flowering control. This expres- 
sion pattern was not strongly affected by the 
genome edits (fig. S59). Plants with the editing 
events in the D subgenome flowered 2 to 3 days 
earlier than controls (Fig. 5E). Further refine- 
ment should help to fully elucidate the impor- 
tance of the TaAGL33 gene for vernalization in 
monocots. These results exemplify how the 
IWGSC RefSeq v1.0 could accelerate the devel- 
opment of diagnostic markers and the design 
of targets for genome editing for traits relevant 
to breeding. 


Conclusions 


IWGSC RefSeq v1.0 is a resource that has the 
potential for disruptive innovation in wheat im- 
provement. By necessity, breeders work with 
the genome at the whole-chromosome level, as 
each new cross involves the modification of 
genome-wide gene networks that control the 
expression of complex traits such as yield. With 
the annotated and ordered reference genome 
sequence in place, researchers and breeders can 
now easily access sequence-level information to 
define changes in the genomes of lines in their 
programs. Although several hundred wheat QTLs 
have been published, only a small number of 
genes have been cloned and functionally char- 
acterized. IWGSC RefSeq v1.0 underpins imme- 
diate application by providing access to regulatory 
regions, and it will serve as the backbone to 
anchor all known QTLs to one common anno- 
tated reference. Combining this knowledge with 
the distribution of meiotic recombination fre- 
quency and genomic diversity will enable breeders 
to more efficiently tackle the challenges imposed 
by the need to balance the parallel selection 
processes for adaptation to biotic and abiotic 
stress, end-use quality, and yield improvement. 
Strategies can now be defined more precisely 
to bring desirable alleles into coupling phase, 
especially in less-recombinant regions of the 
wheat genome. Here the full potential of the 
newly available genome information may be 
realized through the implementation of DNA- 
marker platforms and targeted breeding tech- 
nologies, including genome editing (57). 


Methods summary 


Whole-genome sequencing of cultivar Chinese 
Spring by short-read sequencing-by-synthesis 
provided the data for de novo genome assembly 
and scaffolding with the software package 
DenovoMAGIC2. The assembly was superscaf- 
folded and anchored into 21 pseudomolecules 
with high-density genetic (POPSEQ) and physical 
(Hi-C and 21 chromosome-specific physical maps) 
mapping information and by integrating addition- 
al genomic resources. Validation of the assembly 
used independent genetic (de novo genotyping- 
by-sequencing maps) and physical mapping evi- 
dence (radiation hybrid maps, BioNano “optical 
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maps” for group 7 homeologous chromosomes). 
The genome assembly was annotated for genes, 
repetitive DNA, and other genomic features, and 
in-depth comparative analyses were carried out 
to analyze the distribution of genes, recombi- 
nation, position, and size of centromeres and 
the expansion and contraction of wheat gene 
families. An atlas of wheat gene transcription 
was built from an extensive panel of 850 inde- 
pendent transcriptome datasets and was then 
used to study gene coexpression networks. Fur- 
thermore, the assembly was used for the dis- 
section of an important stem-solidness QTL and 
to design targets for genome editing of genes 
implicated in flowering-time control in wheat. De- 
tailed methodological procedures are described 
in the supplementary materials. 
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INTRODUCTION: Polyploidy, arising from 
whole-genome duplication or interspecific hy- 
bridization, is ubiquitous across the plant and 
fungal kingdoms. The presence of highly related 
genes in polyploids, referred to as homoeologs, 
has been proposed to confer adaptive plasticity— 
for example, through neofunctionalization of du- 
plicated genes or tissue-specific expression. This 
plasticity has facilitated the domestication and 
adaptation of major polyploid crops (e.g., wheat, 
cotton, and coffee). However, despite its likely 
importance, we have a limited understanding 
of the effect of polyploidy on gene expression 
and the extent to which homoeologs are simi- 
lar or different in their expression patterns 
across development and tissues. 


Balanced Triads (70%) 


RATIONALE: Bread wheat is a polyploid de- 
rived from the hybridizations between three 
distinct diploid species and is an informative 
system for analyzing the effects of recent poly- 
ploidy on gene expression. Understanding the 
coordination of homoeologs and identifying the 
mechanisms associated with these processes 
should help define strategies to improve trait 
biology in a crop that provides more than 20% 
of the protein and caloric intake of humans. 


RESULTS: Here we leverage 850 wheat RNA- 
sequencing samples, alongside the annotated 
genome, to determine the similarities and dif- 
ferences between homoeolog expression across 
arange of tissues, developmental stages, and 


Non-Balanced Triads (30%) 


B dominant (2%) A suppressed (8%) 
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Homoeolog expression patterns in polyploid wheat. Seventy percent of triads (A, B, 

and D homoeologs) show balanced expression among homoeologs and are ubiquitously 
expressed (left), whereas ~30% show nonbalanced expression and are more tissue-specific 
(right; symbolized by three exemplar tissues). Variation in promoter elements and non- 
synonymous substitution rates distinguish between individual triads with stable relative 
expression across tissues and triads with more inter-tissue variation (tissue-dynamic triads). 
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cultivars. On average, ~30% of wheat homoeo- 
log triads (composed of A, B, and D genome 
copies) showed nonbalanced expression pat- 
terns, with higher or lower expression from a 
single homoeolog with respect to the other two. 
These differences between homoeologs were 
associated with epigenetic changes affecting 
DNA methylation and histone modifications. 
Although nonbalanced homoeolog expression 
could be partially predicted by expression in 
diploid ancestors, large changes in relative ho- 
moeolog expression were observed owing to 
polyploidization. 

Our results suggest that the transposable 
elements in promoters relate more closely to 
the variation in the relative expression of ho- 
moeologs across tissues than to a ubiquitous 
effect across all tissues. We found that homoeo- 
logs with the highest inter-tissue variation 

had promoters with more 


frequent transposable ele- 


Read the full article | Ment insertions and more 
at http://dx.doi. varied cis-regulatory ele- 
org/10.1126/ ments than homoeologs 


science.aar6089 


that were stable across 
tissues. We also identified 
expression asymmetry along wheat chromo- 
somes. Homoeologs with the largest inter-tissue, 
inter-cultivar, and coding sequence variation 
were most often located in the highly recom- 
binogenic distal ends of chromosomes. These 
transcriptionally dynamic homoeologs are under 
more relaxed selection pressure, potentially rep- 
resenting the first steps toward functional in- 
novation through neo- or subfunctionalization. 
We generated tissue- and stress-specific co- 
expression networks that reveal extensive coor- 
dination of homoeolog expression throughout 
development. These networks, alongside de- 
tailed gene expression atlases (www.wheat- 
expression.com and http://bar.utoronto.ca), 
lay the groundwork to identify candidate genes 
influencing agronomic traits in wheat. 


CONCLUSION: This study provides detailed 
insights into the transcriptional landscape of 
bread wheat, an evolutionarily young poly- 
ploid. Our work shows that homoeolog ex- 
pression patterns in bread wheat have been 
shaped by polyploidy and are associated with 
both epigenetic modifications and variation 
in transposable elements within promoters of 
homoeologous genes. The extensive datasets 
and analyses presented here provide a frame- 
work that can help researchers and breeders 
develop strategies to improve crops by mani- 
pulating individual or multiple homoeologs to 
modulate trait responses. 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
{Corresponding author. Email: cristobal.uauy@jic.ac.uk 
(C.U.); philippa.borrill@jic.ac.uk (P.B.) 
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The coordinated expression of highly related homoeologous genes in polyploid species 
underlies the phenotypes of many of the world’s major crops. Here we combine extensive 
gene expression datasets to produce a comprehensive, genome-wide analysis of 
homoeolog expression patterns in hexaploid bread wheat. Bias in homoeolog expression 
varies between tissues, with ~30% of wheat homoeologs showing nonbalanced expression. 
We found expression asymmetries along wheat chromosomes, with homoeologs showing 
the largest inter-tissue, inter-cultivar, and coding sequence variation, most often located 
in high-recombination distal ends of chromosomes. These transcriptionally dynamic 
genes potentially represent the first steps toward neo- or subfunctionalization of wheat 
homoeologs. Coexpression networks reveal extensive coordination of homoeologs 
throughout development and, alongside a detailed expression atlas, provide a framework 
to target candidate genes underpinning agronomic traits in wheat. 


olyploidy arises from whole-genome dup- 

lication or interspecific hybridization and 

is ubiquitous in eukaryotic plant and fun- 

gal lineages. Polyploidy has been proposed 

to confer adaptive plasticity, thereby shap- 
ing the evolution of plants, fungi, and, to a lesser 
degree, animals (J, 2). This plasticity has facili- 
tated the domestication and adaptation of sev- 
eral major crop species (3), including hexaploid 
bread wheat (Triticum aestivum; AABBDD sub- 
genome), which is derived from relatively recent 
interspecific hybridizations between three dif- 
ferent diploid species. In such polyploids, gene 
duplication alters the transcriptional landscape 
(4) by providing additional flexibility to adapt 
and evolve new patterns of gene expression for 
homoeologous gene copies (5). This flexibility 
has been suggested to be an important mecha- 
nism for controlling adaptive traits (6, 7)—for 
example, through neofunctionalization of dupli- 


cated genes (8) or tissue-specific expression (9). 
However, despite the likely importance of poly- 
ploidy in affecting gene expression, we have a 
limited understanding of the extent to which 
homoeologs resemble or differ from each other 
in their expression patterns, the spatiotemporal 
dynamics of these relationships, and how epi- 
static interactions between individual homoe- 
ologs affect biological traits. The new genomic 
resources available for wheat (0), along with 
its meiotic stability (17) and syntenic gene order 
(12), make it a particularly informative system 
for gaining insight into the effects of recent 
polyploidy on gene expression. 

In this study, we leveraged available RNA 
sequencing (RNA-seq) data (529 samples from 
28 studies) and added 321 samples to explore 
global gene expression in hexaploid wheat across 
a diverse range of tissues, developmental stages, 
cultivars, and environmental conditions (13). We 


organized these sets of RNA-seq samples into 
partially overlapping datasets from (i) a single 
developmental time course experiment (n = 209 
samples), (ii) the reference accession Chinese 
Spring (CS) under nonstress conditions (n = 
123 samples), (iii) four main tissue types under 
nonstress conditions (7 = 537 samples), and (iv) 
seedling samples from abiotic (m = 50) and biotic 
(n = 163) stress experiments including controls 
(table S1). These datasets, alongside a complete 
and annotated genome and transcriptome (10), 
provide an opportunity to conduct homoeolog- 
specific transcriptome profiling and to generate 
gene regulatory networks to better understand 
the spatiotemporal coordination of individual ho- 
moeologs underlying trait biology on a genome- 
wide scale. 


A developmental gene expression atlas 
in polyploid wheat 


We first assessed expression patterns through a 
developmental time course of the commercial 
wheat cultivar Azhurnaya, including 209 RNA- 
seq samples representing 22 tissue types from 
grain, root, leaf, and spike samples across mul- 
tiple time points (Fig. 1). We quantified expres- 
sion using pseudoalignment of RNA-seq reads to 
the RefSeqv1.0 transcriptome, as implemented 
in kallisto (14), which accurately quantifies reads 
in a homoeolog-specific manner in polyploid 
wheat (13, 15) (figs. S1 and S2). We found evi- 
dence of expression for 83,741 (75.6% of 110,790) 
high-confidence genes, on the basis of expression 
of >0.5 transcripts per million (TPM) in at least 
one of the 22 tissue types, and we conducted 
complexity (table S2) and differential expression 
analyses (fig. S3). Tissue type distinguished sam- 
ples across development (fig. S4) (13), consistent 
with observations in other plant and animal spe- 
cies (16, 17). Within similar tissue types, subgenome 
of origin also influenced expression patterns, 
consistent with previous results in wheat grain 
samples (18). This gene expression atlas provides 
avaluable resource for breeders and researchers 
to query for and analyze their genes of interest 
through www.wheat-expression.com (15) and the 
Wheat eFP Browser at http://bar.utoronto.ca/ 
efp_wheat/cgi-bin/efpWeb.cgi (fig. S5) (19). 


Homoeolog expression patterns 


In polyploid wheat, quantitative variation for 
many agronomic traits is modulated by genetic 
interactions between multiple sets of homoeo- 
logs in the A, B, and D subgenomes (20). These 
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interactions range from buffering effects ob- 
served when gene homoeologs are function- 
ally redundant (27) to dominance effects where 
variation in a single homoeolog can lead to dom- 
inant phenotypes (22). Understanding how these 
interactions influence gene expression will help 
inform strategies to improve crops by target- 
ing and manipulating individual or multiple 
homoeologs to quantitatively modulate trait re- 
sponses (20). 

To determine patterns of homoeolog expres- 
sion, we analyzed 123 RNA-seq samples repre- 
senting 15 tissues under nonstress conditions 
(table S1) from CS. This was the same accession 
used to generate the reference genome (10); 
thus, cultivar-specific polymorphisms were ex- 
cluded from our analysis. We found evidence of 
expression for 82,567 (74.5%) high-confidence 
genes, consistent with the developmental time 
course of the cultivar Azhurnaya. We focused 
on 53,259 genes that had a 1:1:1 correspon- 
dence across the three homoeologous subgenomes, 
referred to as triads, and a summed expression 
of >0.5 TPM across the triad (64.5% of expressed 
genes, 96.1% of all triads; table S3). The majority 
of these expressed triads (94.3%) were in an- 
cestral (i.e., syntenic) physical positions in at 


coleoptile 
seedling stage 
shoot apical meristem 


3rd leaf blade | 


1st leaf stage 


1st leaf blade 


ist leaf sheath | 


3rd leaf stage 


3rd leaf sheath 


least two of the three subgenomes [50,238 genes 
corresponding to 16,746 syntenic triads (J0)], 
whereas 5.7% (1007 triads) had all genes in non- 
syntenic positions and are thus referred to as 
nonsyntenic triads. For each of the 17,753 ex- 
pressed triads, we standardized the relative ex- 
pression of the A, B, and D subgenome homoeologs 
(fig. S6) so that the sum was 1.0 in each individ- 
ual tissue. In this way, the relative abundance 
of homoeolog expression is comparable within 
triads, as well as across tissues, allowing the 
study of homoeolog expression bias (23). 

We performed a global analysis combining 
data across all 15 tissues and focused on the 
16,746 syntenic triads, but we discuss patterns 
in nonsyntenic triads where relevant. We found 
that the D subgenome had a subtly yet significantly 
higher relative abundance (33.65%) than the B 
(33.29%) and A (33.06%) subgenomes (Kruskal-Wallis 
P < 0.001; tables S4 and S5). The homoeolog ex- 
pression bias of the D subgenome is unlikely to 
reflect technical issues (fig. S2) and was found 
in 11 of the 15 tissues, was consistent across 
multiple expression abundance cutoffs, and was 
also significant in the developmental time course 
(in all 22 tissues) of the cultivar Azhurnaya [figs. 
S7to S9 and table S5 (13)]. This effect, however, 
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is subtle when compared with the homoeolog 
expression bias observed in evolutionarily older 
polyploid crops such as cotton, in which genome 
doubling occurred at an earlier time point (24). 

The relative expression of each homoeolog de- 
termined a triad’s position in the ternary plot for 
the global analysis (Fig. 2A), as well as for analy- 
ses of individual tissues (figs. S6 and S10). From 
these plots, we defined seven homoeolog expres- 
sion bias categories (13): a balanced category, 
with similar relative abundance of transcripts 
from the three homoeologs, and six homoeolog- 
dominant or homoeolog-suppressed categories, 
classified on the basis of the higher or lower 
abundance of transcripts from a single homoeo- 
log with respect to those from the other two 
(Fig. 2A). Most syntenic triads (72.5%) were as- 
signed to the balanced category within each tis- 
sue, with balanced triads ranging from 62.6% in 
the stigma and ovary to 78.9% in roots (Fig. 2B 
and table S6). Triads with single-homoeolog 
dominance were infrequent (7.1%; range among 
tissues, 4.7 to 11.3%), whereas syntenic triads 
classified as single-homoeolog-suppressed were 
more common (20.5%; range, 16.3 to 27.1%; Fig. 2B). 
These patterns shifted significantly in the 1007 
nonsyntenic triads, which had fewer balanced 
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Fig. 1. Developmental time course of bread wheat. Shown is a schematic overview of tissues sampled for the RNA-seq expression atlas across 
multiple growth stages (labeled in blue). Details of all samples are provided in table S1. 
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triads (58.9%) and a higher proportion of domi- 
nant (14.5%) and suppressed (26.6%) triads across 
tissues (x? P < 0.001; tables S7 and S8). Across 
tissues, no differences were observed in the fre- 
quency of single-homoeolog dominance between 


subgenomes (tables S6 and S7). However, across 
all 15 tissues, D-homoeolog suppression was sig- 
nificantly less frequent (5.7%) than either A- or 
B-homoeolog suppression (7.5 and 7.2%, respec- 
tively; Kruskal-Wallis P < 0.05), and this pat- 


tern was also observed in nonsyntenic triads 
and the developmental time course (tables S6 
to $8). This pattern explains in part the subtle 
homoeolog expression bias observed for the 
D subgenome relative to those observed for the 
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Fig. 2. Homoeolog expression bias in syntenic homoeolog triads. 
(A) Ternary plot showing relative expression abundance of 16,746 
syntenic triads (50,238 genes) in hexaploid wheat in the combined 
analysis of 15 tissues from Chinese Spring. Each circle represents 

a gene triad with an A, B, and D coordinate consisting of the relative 
contribution of each homoeolog to the overall triad expression (an 
example is shown on the top left). Triads in vertices correspond to 
single-subgenome-—dominant categories, whereas triads close to edges 
and between vertices correspond to suppressed categories. Balanced 
triads are shown in gray. Box plots indicate the relative contribution 

of each subgenome based on triad assignment to the seven categories. 
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(B) Proportion of triads in each category of homoeolog expression bias 
across the 15 tissues (excl, excluding). (C) Box plot of absolute TPM 
expression abundance for each subgenome from the seven categories. 
(D) Number of tissues in which homoeolog-suppressed (brown), 
homoeolog-dominant (teal), and balanced (gray) triads are expressed. 
(E) Metagene profile for histone H3K27me3 marks from —2 kb upstream 
of the ATG to +2 kb downstream of the stop codon (normalized for 


gene length) for balanced 


triads (gray), dominant triads separated 


into dominant (teal) and nondominant (pale blue) homoeologs, and 
suppressed triads separated into suppressed (tan) and nonsuppressed 


(brown) homoeologs. 
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A- and B-subgenome homoeologs. This observa- 
tion is consistent with the lower distribution 
of repressive H3K27me3 (histone H3 lysine 27 
trimethylation) histone marks across the gene body 
of D-subgenome homoeologs compared with those 
of the A- and B-subgenome homoeologs (fig. S11). 

Genes from syntenic triads in the balanced 
category were expressed across a wider range 
of tissues and had higher absolute transcript 
abundance, on a per-subgenome basis (12.2 tissues; 
median, 4.03 TPM), than genes in the suppressed 
(9.1 tissues; median, 1.51 TPM) or dominant 
(6.9 tissues; median, 0.57 TPM) categories (two- 
sample Kolmogorov-Smirnov test, P < 0.001; 
Fig. 2, C and D, and tables S9 and S10). The ab- 
solute transcript abundance data show that 
dominant triads are not the result of an overall 
increase in expression of a single homoeolog, 
but rather result from the relatively lower ex- 
pression of the two other homoeologs. 

To determine if the differences among homoeo- 
logs are a consequence of polyploidization, we 
analyzed RNA-seq data from diploid and tet- 
raploid progenitor species and newly created 
synthetic hexaploid wheat (SHW) lines (25). 
We found that 67.5% of nonbalanced triads in 
modern-day wheat have a different homoeolog 
expression bias category than that observed in 
SHW, with all three subgenomes being equally 
affected (table S11 and figs. S12 to S14). Likewise, 
47.1% of nonbalanced triads in SHW are ina 
different category than would be expected on the 
basis of the progenitor species, with D-subgenome 
homoeologs most strongly influenced (13) (table 
$12 and fig. S15). These results suggest that the 
polyploid context and the polyploidization pro- 
cess itself affect the relative expression of ho- 
moeologs compared with the baseline expression 
in the progenitor species (13), which has also 
been observed during the evolution of polyploid 
cotton (26) and monkeyflower (27). 

We hypothesized that epigenetic mechanisms 
might be associated with differences in homoe- 
olog expression patterns. To test this, we exam- 
ined the associations of transposable elements 
(TEs), DNA methylation, and histone modifica- 
tions with the relative expression of triads in 
leaves of CS. We found no clear relationship be- 
tween the presence of TEs in promoter regions 
and altered expression patterns between ho- 
moeologs in dominant and suppressed triads 
(Tukey’s Honestly Significant Difference P > 
0.6; fig. S16 and table S13) (13). However, we 
identified significant differences in gene-body 
DNA methylation and histone modifications 
among homoeologs (13). 

Gene-body CG methylation is widely con- 
served in angiosperms, although its functional 
significance is currently under debate (28, 29), 
given that two angiosperm species lack this epi- 
genetic mark altogether (30). We found higher 
gene-body CG methylation in constitutively ex- 
pressed triads than in more tissue-specific triads 
(balanced > suppressed > dominant; fig. S17). 
Within the nonbalanced triads, homoeologs with 
higher expression had higher CG methylation 
than their corresponding nondominant and sup- 
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pressed homoeologs (Mann-Whitney P < 0.001; 
fig S17). These results are consistent with gene- 
body CG methylation associated with housekeep- 
ing genes and its suggested role in homoeostatic 
gene expression (29). Similarly, the more highly 
expressed homoeologs within nonbalanced tri- 
ads had higher active (H3K36me3 and H3K9ac; 
ac, acetylation) histone marks and lower re- 
pressive (H3K27me3) histone marks in the gene 
body (Mann-Whitney P < 0.001; fig. S11). For 
H3K27me3, these differences were not limited 
strictly to the gene body but extended into the 
upstream and downstream regions for both 
dominant and suppressed triads (Fig. 2E), con- 
sistent with the tight association of H3K27me3 
with inactive gene promoters (37). These results 
suggest that epigenetic status in gene bodies, as 
well as upstream and downstream regions, is 
associated with homoeolog expression bias in 
polyploid wheat, consistent with results in mon- 
keyflower showing changes in DNA methylation 
upon polyploidization (27). 

Breeders rely on recombination to generate 
new combinations of haplotypes for improving 
cultivars. In wheat, chromosome position strong- 
ly influences recombination rates, with relatively 
low recombination rates in the interstitial and 
proximal regions (R2a, C, and R2b genomic com- 
partments) but markedly higher rates toward 
the distal ends of the chromosomes (R1 and R3 
genomic compartments) (32). In our analyses, 
syntenic triads in the balanced category were 
overrepresented in the low-recombination re- 
gions (R2 and C), which have higher levels of ac- 
tive histone marks (H3K36me3 and H3K9ac) (10), 
consistent with the higher expression of balanced 
triads. Homoeolog-dominant and homoeolog- 
suppressed triads were overrepresented toward 
the high-recombination distal ends of chromo- 
somes (R1 and R3; x” P < 0.001; table $14), which 
have higher levels of repressive (H3K27me3) his- 
tone marks (JO), consistent with the lower expres- 
sion of dominant and suppressed triads. This 
pattern was also observed in the developmental 
time course of the cultivar Azhurnaya. However, 
when comparing the CS and Azhurnaya cultivars 
(nine tissues in common), we found that 84.5% 
of genes in the R2 and C regions had the same 
expression category between cultivars, whereas 
only 72.2% of genes in the R1 and R3 regions did 
so (? P< 0.001; table S15). These differences in 
homoeolog expression bias across cultivars have 
important implications for breeding because they 
suggest that through genetic crosses, breeders 
not only generate new combinations of haplo- 
types with differential expression of alleles, but 
also rearrange and select for homoeolog expres- 
sion bias between cultivars. 


Variation of triad expression patterns 

Polyploidy may confer phenotypic plasticity by 
allowing homoeologs to be expressed differently 
across tissues and/or environmental conditions 
(8). Our analyses above provide a static overview 
of the relative homoeolog expression bias in in- 
dividual tissues. Therefore, we explored whether 
syntenic triads retain their homoeolog expres- 
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sion bias category across the 15 tissues (table 
S16). We found that 83.6% of balanced triads 
remained balanced in each of the 15 individ- 
ual tissues, whereas dominant and suppressed 
triads tended to be more variable across tissues, 
with only 73.4 and 62.2%, respectively, staying 
within their global dominance group across all 
15 tissues (Fig. 3A). Dominant and suppressed 
triads shifted most often to adjacent categories 
(16 to 20%) in the ternary plots and in few cases 
(<3.0%) changed to opposite categories (fig. S18). 
These patterns were also observed in the devel- 
opmental time course (table S16). These data 
show that across tissues, triads most often re- 
mained consistent in their homoeolog expres- 
sion bias classification, a phenomenon also seen 
across seven tissues in allotetraploid Tragopogon 
mirus (33). 

To complement this analysis, we determined 
the variation in behavior of each triad within the 
ternary plot across the 15 tissues by calculating 
the mean distance between the triad’s position 
in each tissue and its global average position (13) 
(fig. S19). This generated a distribution of mean 
distances (Fig. 3B); we focused on the 10% most 
stable triads (defined as those having the short- 
est mean distances across tissues) and the 10% 
most dynamic triads (largest mean distances) 
(Fig. 3, B to D). Stable triads were expressed more 
highly than dynamic triads (median, 8.2 versus 
3.2 TPM; P < 0.001) and had a higher expression 
breadth, being expressed across almost all sam- 
ples, whereas dynamic triads were more tissue- 
specific (P < 0.001) (Fig. 3E and table S17). Stable 
triads were enriched for high-level gene ontology 
(GO-slim) terms associated with housekeeping 
processes (e.g. translation and cell cycle), whereas 
dynamic triads were enriched for defense and 
external stimuli responses and secondary meta- 
bolic processes, functions that more frequently 
determine differences in individual fitness (table 
$18) (6). In the global analysis, stable triads were 
significantly enriched for the balanced category 
(94.2%), whereas dynamic triads were almost 
equally spread between suppressed (37.9%), dom- 
inant (30.5%), and balanced (31.6%) categories 
(y? P < 0.001; Fig. 3F and table $19). This pat- 
tern is consistent with stable triads being most 
frequently located in proximal regions (C), whereas 
dynamic triads tend to locate in distal ends of 
chromosomes (R1 and R3) (y” P < 0.001; table $20). 
These results demonstrate expression asymmetry 
across wheat chromosomes, whereby the high- 
recombination distal ends of chromosomes have 
triads that exhibit higher homoeolog expression 
bias, are more dynamic across tissues, and have 
higher expression variation between cultivars 
than triads in the low-recombination proximal 
regions (Fig. 3G). This asymmetry is also re- 
flected in the contrasting distributions of histone 
marks and DNA methylation along chromosomes 
(10). The difference in epigenetic marks identi- 
fied in leaves makes it tempting to speculate that 
epigenetic marks may also be associated with 
triad expression variation across multiple tissues. 

We next investigated if divergence in spatial 
expression patterns (as measured by the mean 
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distance statistic described above) was coupled 
with 5’ promoter and protein sequence diver- 
gence in syntenic triads (13). The 1.5-kilobase 
(kb) promoters of dynamic triads more frequently 
contained TEs (88.3 versus 79.2%), which were 
closer to the translation start site (1113 versus 
1234 bp away) but shorter (median, 220 versus 
259 bp), than those in stable triads, leading to 
equivalent TE densities (all comparisons, Kruskal- 
Wallis P < 0.001; fig. S20 and table S13). These 


closer, more frequent, and shorter TEs could 
potentially act as novel cis-regulatory elements 
(34) or influence epigenetic marks (35). These 
results indicate that the promoter TE landscape 
relates more closely to the variation in the rela- 
tive expression of homoeologs across tissues than 
to a ubiquitous effect across all tissues (table S13). 
Although only subtle differences in sequence iden- 
tity were found between stable and dynamic 
triads (85.5 versus 85.0%; P = 0.045) (Fig. 3H 


and table $21), dynamic triads had fewer con- 
served transcription factor (TF) binding site 
motifs across the three homoeologs (37% fewer; 
P< 0.001; fig. S21). Across coding sequences, 
we showed a stepwise decrease in conservation 
of both the nucleotide and protein identities from 
stable (average, 97.2% coding sequence and 97.3% 
protein) to dynamic (95.0 and 93.4%) triads (both 
P < 0.001; table S21). We compared nonsynony- 
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Fig. 3. Variation of triad expression patterns. (A) Variation of 
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global analysis) across 15 tissues. (B) Distribution of mean distance 
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rates between homoeologs and observed that 
dynamic triads had significantly higher K,/K, 
than stable triads (0.33 versus 0.21; Mann Whitney 
P< 0.001; Fig. 3H and table $22). This higher 
ratio suggests that triads with greater divergence 
in spatial expression patterns are under more 
relaxed selection pressure, as seen for duplicated 
genes in humans (9), but not in soybean (36) and 
carp (37). This conclusion is supported by the ob- 
servation that nonsyntenic triads, which had greater 
expression divergence (10.5% larger mean dis- 
tance; Mann-Whitney P < 0.001), also had sig- 
nificantly higher K,/K, (0.42; Mann-Whitney 
P< 0.001) compared with syntenic triads (table 
$22). The above relationships were consistent 
when using different percentage cutoffs to define 
stable and dynamic triads (5 and 25%), as well as 
in the developmental time course of the cultivar 
Azhurnaya (tables S21 and $22). These results 
show positive coupling of divergence in spatial 
expression patterns with divergence in TE and 
cis-regulatory elements in promoters and se- 
quence divergence in coding sequence among 
wheat homoeologs. It is possible that diver- 
gence in spatial expression patterns, alongside 
relaxation of selection pressure, can lead to func- 
tional innovation through homoeolog neo- or 
subfunctionalization. 


Coordinated expression of 
homoeolog triads 


Our analyses provide a framework to describe 
the relative expression of individual homoeologs 
between discrete triads in space and time. To 
understand how this coordination of homoeo- 
log spatiotemporal expression may influence 
biological processes, we developed a series of 
coexpression networks to provide insight into 
tissue-specific developmental and stress-related 
processes. 

We constructed four separate tissue-specific 
coexpression networks from nonstress RNA-seq 
samples from grain (n = 119 samples), leaf (7 = 
245), root (n = 45), and spike (7 = 128), using all 
genes expressed at more than 0.5 TPM in the 
given tissue (73). These networks were com- 
posed of 51 to 78 modules and contained 42.3 
to 88.0% of all expressed genes in each tissue 
(fig. S22, table S23, and data S1). We found that 
across all tissue networks, homoeologs from 
37.4% of the syntenic triads were in the same 
coexpression module, suggesting a highly coor- 
dinated expression pattern for these triads (y? P < 
0.0001 with respect to random triads; table S24). 
However, the majority of triads (62.6%) had at 
least one homoeolog outside the same module. 

To quantify whether homoeologs outside the 
module had similar or divergent expression pat- 
terns, we calculated a threshold based on the 
pairwise distance between homoeologs (13). We 
found that 29.6% of syntenic triads had a diver- 
gent pattern, wherein the expression of at least 
one homoeolog exceeded the distance threshold 
in the tissue network (Fig. 4A and fig. $23). 
Conversely, 33% of triads had a similar pattern, 
wherein all pairwise distances between homoeo- 
logs were lower than the threshold, suggesting a 
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Fig. 4. Homoeolog coexpression patterns in tissue networks. (A) Triad assignment to same, 
similar, and divergent modules in tissue coexpression networks. (B) Stable, middle, and dynamic 
triad assignment to same, similar, and divergent modules in the root network. (C) Neighbor-joining 
phylogenetic tree of homologs for the Arabidopsis MADS_I| gene AGL21. Wheat orthologs from 
chromosome group 2 are assigned to the root-specific module 61 (red), whereas chromosome group 
6 orthologs are assigned to modules 1 and 13 (blue and purple). 


subtler variation in a single homoeolog. These 
values showed significant variation between tis- 
sue networks, ranging from 7% divergent triads 
in the leaf network to more than 38% divergent 
triads in the root and grain networks (Fig. 4A). 
Nonsyntenic triads had a higher proportion of 
divergent triads in all tissue networks compared 
with syntenic triads (mean, 35.1 versus 29.6%; 
¢ P < 0.001; table S24). Using the same crite- 
rion as before (triad mean distance between tis- 
sues and global average position), we identified 
the 10% most stable and dynamic syntenic triads 
for each tissue-specific network. We found that 
dynamic triads were more frequently in diver- 
gent modules than stable triads for all four tis- 
sue networks (P < 0.001; Fig. 4B and fig. S24). 
These results are consistent with the homoeo- 
log expression bias analyses and support the 
idea that although many triads are expressed 
in a coordinated spatiotemporal pattern (with 
the same or similar profile), almost 30% of 
syntenic and 35% of nonsyntenic triads have 
a divergent expression profile. Transcriptional 
divergence occurs both immediately upon poly- 
ploidization and after polyploidization (figs. 
$12 to S14) and may represent initial steps 
toward neo- or subfunctionalization of wheat 
homoeologs. 


Exploiting development and stress 
networks for biological discovery 


To explore the potential for biological discovery, 
we first compared modules between networks to 
identify tissue-specific gene networks. Across the 
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four networks, 73.2% of modules had significant 
overlap (Fisher’s exact test, P < 0.05) with mod- 
ules in all four networks, with the root having the 
fewest conserved modules (61.1%) and the spike 
having the most (86.2%) (data S2). In the root, 
there were three modules that were not found 
in any other tissue, with the largest of these (root 
module 61; 82 genes) enriched for root-related 
plant ontology (PO) terms (e.g., root procambium, 
P = 3.3 x 10°, and central root cap of primary 
root, P = 4.5 x 10°; table $25). We hypothesized 
that genes encoding TFs controlling processes re- 
lated to these PO terms would also be coexpressed 
within module 61. We found that four of the 10 
genes encoding TFs in this root-specific module 
had known functions related to root development 
in Arabidopsis or rice (38, 39) (table S26). Three 
of these TFs belonged to one homoeolog triad in 
the MADS_II family, and one of their Arabidopsis 
orthologs (AGL2/) has been shown to regulate 
lateral root development through auxin accu- 
mulation (39). To understand the target genes 
of these TFs in wheat, we conducted a comple- 
mentary network analysis using genie3, which 
predicted target genes of TFs across all 850 sam- 
ples (13). Target genes of the three TFs were 
enriched for cell wall processes and lignifica- 
tion, consistent with their putative role in the 
differentiation zone where lateral roots emerge 
(tables S27 and S28). Closely related paralogs on 
chromosome group 6 in wheat were not located 
in root module 61; rather, they were in modules 1 
and 13 (Fig. 4C). These modules were conserved 
in all other tissue networks, implying a more 
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general function for genes within them. Sup- 
porting this hypothesis, the rice ortholog of the 
chromosome 6 paralogs (OsMADS57) has been 
shown to play a role in tillering (40). 

A key challenge for wheat breeding is the 
selection of cultivars with tolerance to multiple 
stresses. Therefore, we focused on stress re- 
sponses in seedlings and young vegetative plants, 
for which 10 independent studies with 12 dis- 
tinct abiotic and disease stresses were available 
(table S1). We constructed gene coexpression 
networks for abiotic and disease stresses sepa- 
rately, including control samples from the same 
studies to allow for links between disease status 
and gene expression (13). We integrated the two 
networks to identify modules that might be com- 
mon to both abiotic and disease responses. We 
found 84 pairs of modules between the two net- 
works that had significantly overlapping gene 
content and were significantly correlated with 
both an abiotic and a disease stress (tables S29 
to S31). The most significant overlap was be- 
tween disease module 12 and abiotic module 2 
(P =1.3 x 10°), which shared 355 genes (Fig. 5A). 
These two modules had similar enrichment for 
GO-slim terms relating to signal transduction 
and response to stimulus (table S32), suggest- 
ing that they might perform similar biological 
functions. 

Among the 355 shared genes, there were 16 
encoding TFs, six of which have orthologs in rice 
or Arabidopsis with proven roles in abiotic or 
disease stress, and a further three have orthologs 
differentially expressed during stress in these 
species (table S33). Furthermore, on the basis of 
the genie3 analysis, 11 of the 16 TFs have targets 
that are enriched in stress responses, and seven 
have targets that are enriched simultaneously 
in biotic and abiotic stress responses (Fig. 5B). 
Of the genes encoding these TFs, two homoeo- 


A B 


logs stood out as potential common regulatory 
components of abiotic and disease response: 
TraesCS5A401G237900 and TraesCS5B01G236400, 
which encode heat shock factor (HSF) TFs. These 
two HSF TF-encoding genes were in the top 10 
most central genes within disease module 12 
(table S34), as measured by intramodule con- 
nectivity, a value strongly correlated with the 
influence of a gene on a phenotype (41). The 
387 predicted targets of the TFs encoded by these 
two genes were frequently allocated to module 
12 of the disease network (39.5%) and module 
2 of the abiotic stress network (28.0%) (table S35). 
The Arabidopsis ortholog of these genes, TBF1, 
was originally identified for its role in pathogen 
defense response (42) and has been shown to 
play a key role in the transition from growth to 
defense (43), while also positively regulating ac- 
quired thermotolerance (44). Recently, a “TBFI 
cassette” including the promoter and 5’ leader 
region of TBFI was used to engineer broad- 
spectrum disease resistance in both Arabidopsis 
and rice without a fitness cost (45). The fact that 
Arabidopsis TBF1 is functional in rice suggests 
that this regulatory mechanism may be conserved 
across species, making the wheat orthologs iden- 
tified here promising targets for further studies. 
These and other highly connected genes (table 
$34) are strong candidates for controlling stress 
responses, and the functions of their orthologs 
support this hypothesis. These results demon- 
strate the power of the datasets and show that 
integrating gene networks from wheat, alongside 
phylogenetic relationships and knowledge of bio- 
logical function in model species, can help identify 
candidate genes for further study in wheat. 


Concluding statement 


This study provides detailed insight into the 
spatiotemporal transcriptional landscape of 
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polyploid wheat. We find evidence that the dif- 
ferences in relative expression among homoeo- 
logs observed in modern-day wheat may have 
been established both upon the polyploidiza- 
tion of wheat itself and during the subsequent 
10,000 years of polyploidy; these differences may 
have been determined through epigenetic changes 
affecting both DNA methylation and histone 
modifications. We identified asymmetries along 
wheat chromosomes for a series of features 
relating to homoeolog expression bias with impor- 
tant implications for breeding. Our work provides 
a framework for the generation of hypotheses 
about biological function in polyploid wheat, 
which can now be experimentally tested using 
recent developments in sequenced mutant pop- 
ulations (46) and genome editing approaches (47). 
Ultimately, this knowledge will help researchers 
and breeders modulate allelic variation across 
homoeologs to improve quantitative traits in poly- 
ploid wheat. This is an urgent task for achieving 
global food security, given that wheat provides 
more than 20% of the protein and caloric intake 
(48) of humans. 


Materials and methods 
RNA-seq samples 


We included 246 samples previously described 
(15) and complemented this with 283 RNA-seq 
samples which were deposited in the Short 
Reads Archive (SRA) between August 2015 and 
January 2017. An additional 321 RNA-seq samples 
from six studies were used for this analysis and 
are detailed in the supplementary materials (73). 


Mapping of RNA-seq reads to reference 


For all 850 samples, metadata was assigned as 
described in (15), with high and low-level factors 
for tissue, age, variety, and stress. Due to the rel- 
atively large number of low-level tissues (59), we 
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Fig. 5. Overlapping modules within abiotic and disease stress networks. (A) Number of genes in abiotic module 2 and disease module 12 and 
the overlap between modules. The number of transcription factors is indicated in parentheses. (B) Transcription factors found in both abiotic 2 and 


disease 12 (left) and the top five enriched GO terms of their targets, as identified by genie3 (right). 
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further defined an intermediate level of tissues 
comprising 32 factors (average 26.5; median 12 
replicates per factor) which was used for this 
study. We also assigned an intermediate level 
of stress comprising 15 factors (average 14.5; 
median 6 replicates per factor). We used kallisto 
v0.42.3 (14) to map the 850 RNA-seq samples 
to the Chinese Spring RefSeqv1.0+UTR tran- 
scriptome reference. We used default param- 
eters previously shown to result in accurate 
homoeolog-specific read mapping in polyploid 
wheat (15) (fig. S1). We summarized expression 
levels from the transcript level to the gene level 
using tximport v1.2.0. We established the crite- 
ria that at least 1% of samples for a given gene all 
required to have expression values over 0.5 TPM 
for that gene to be considered expressed (initial 
850 filter). 

To confirm that kallisto enables homoeolog 
specific mapping (75) we analyzed expression of 
HC genes expressed >0.5 TPM in nulli-tetrasomic 
wheat lines from the publicly available study 
SRP028357 (49). The nulli-tetrasomic lines were 
missing an entire chromosome (1A, 1B, or 1D) 
which was replaced by a duplication of another 
homoeologous chromosome, e.g. NullilATetra1B 
has 0 copies of 1A, 2 copies of 1B and 1 copy of 1D. 
We determined stringent homoeolog-specific 
mapping using a series of criteria detailed in 
the supplementary materials (13). 


Analyses of expressed genes 


Starting from the subset of genes considered 
expressed using the initial 850 filter criterion, 
we determined genes which were expressed in 
at least one tissue within the Azhurnaya devel- 
opmental time course (209 samples; 22 inter- 
mediate tissues) and Chinese Spring no stress 
(123 samples; 15 intermediate tissues) datasets. 
For this analysis, we first calculated the average 
TPM expression of each gene in each of the in- 
termediate tissue types (average expression per 
tissue). The number of samples that went into 
generating this average expression per tissue 
value varied for each intermediate tissue and 
are available in table S1. We considered a gene 
expressed when its average expression per tissue 
was > 0.5 TPM in at least one intermediate 
tissue. For both datasets we focused on HC gene 
models (10). Whilst expression data was also 
assessed for LC genes, we excluded these from 
the main analysis to avoid confounding effects 
from pseudogenes and low-quality gene models. 
Through this analysis we found evidence of ex- 
pression for 83,741 (75.6%) HC genes in Azhurnaya 
and 82,567 (74.5%) HC genes in Chinese Spring. 
Using the average expression per tissue values, 
we also determined the global expression of each 
gene across all tissues in which it was expressed 
(based on the >0.5 TPM criteria in the tissue). 
This generated an average value across tissues, 
rather than a geometric mean across all sam- 
ples, to account for the variation in the number 
of samples per tissue. It also excludes tissues 
in which a gene is not expressed. This average 
across expressed tissues is referred to as either 
the “global analysis” or the “combined analysis 
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(all tissues)” across the main text and in the 
supplementary materials and tables. 


Relative expression levels of the A, B, and 
D subgenome homoeologs across triads 


The analysis focused exclusively on the gene 
triads which had a 1:1:1 correspondence across 
the three homoeologous subgenomes, including 
17,400 syntenic and 1074 nonsyntenic triads 
(total of 18,474 triads or 55,422 genes). Start- 
ing from the subset of genes considered ex- 
pressed using the initial 850 filter criterion, we 
defined a triad as expressed when the sum of the 
A, B, and D subgenome homoeologs was > 0.5 TPM. 
This allowed us to include triads in which, for 
example, only a single homoeolog was expressed, 
and which could later be classified as a dominant 
triad. Using this criterion, we defined a total of 
53,259 genes (17,753 triads) which were con- 
sidered expressed (table S3). To standardize the 
relative expression of each homoeolog across the 
triad, we normalized the absolute TPM for each 
gene within the triad as follows 


TPM(A) 


expression, = TPM(A) + TPM(B) + TPM(D) 
expressiong = a 
Xp 3 ~ TPM(A) + TPM(B) + TPM(D) 
; TPM(D) 
expressiony = 


TPM(A) + TPM(B) + TPM(D) 


where A, B, and D represent the gene corre- 
sponding to the A, B, and D homoeologs in the 
triad. The normalized expression was calcu- 
lated for each one of the intermediate tissues 
and for the average across all expressed tissues 
(“combined analysis” as described previously). 
Fig. S6 shows an example of these calculations 
for the roots and the combined analysis across 
three triads. The values of the relative contri- 
butions of each subgenome per triad were used 
to plot the ternary diagrams using the R package 
ggtern (50). 


Definition of homoeolog expression 
bias categories 


The ideal normalized expression bias for the sev- 
en categories was defined as shown in table S37. 

We calculated the Euclidean distance (rdist 
function from R 3.3.2) from the observed nor- 
malized expression of each triad to each of the 
seven ideal categories listed above. We assigned 
the homoeolog expression bias category for each 
triad by selecting the shortest distance. This was 
done for each of the intermediate tissue as well 
as for the average across all expressed tissues 
(combined analysis). 


Analysis of the effects of polyploidy on 
homoeolog expression bias 


We used RNA-seq data (25) which consisted of 
two datasets based on RNA-seq samples from 
the youngest leaf at fifth leaf stage. Dataset 1 
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(SHW1) included samples from tetraploid (BBAA) 
Triticum turgidum ssp. turgidum wheat acces- 
sion AS2255, diploid Ae. tauschii (DD) accession 
AS60, and the synthetic hexaploid wheat (SHW1; 
BBAADD) resulting from the cross between the 
tetraploid and Ae. tauschii accessions. Dataset 2 
(SHW2) consisted of tetraploid T. turgidum ssp. 
durum cv Langdon (BBAA), the same diploid Ae. 
tauschti (DD) accession AS60, and an indepen- 
dent synthetic hexaploid wheat (SHW2) derived 
from Langdon x AS60 (BBAADD). Note that 
AS2255 and Langdon are both T. turgidum ssp., 
but are defined as different subspecies based 
primarily on morphological features. These ex- 
periments recreate the polyploidization events 
that gave rise to modern bread wheat and the 
resulting SHW has the same genome compo- 
sition as the CS and Azhurnaya datasets exam- 
ined in this study. 

We analyzed the RNA-seq from both data- 
sets by mapping reads to the CS RefSeqvl1.0 
transcriptome using the same bioinformatics 
pipeline as before (see “Mapping of RNA-seq 
reads to reference” section). However, for the 
tetraploid datasets we used only the A and B 
subgenome transcripts as a reference, for the dip- 
loid D genome datasets we used only D subgenome 
transcripts, and for the SHW datasets we used 
the complete RefSeqv1.0 transcriptome as the 
reference, as in CS and Azhurnaya. To generate 
the expected hexaploid wheat transcriptome 
based on progenitor species we weighted the 
TPM values from the tetraploid by 2/3 and the 
AS60 TPM values by 1/3 to maintain a total TPM 
of 10° in the combined dataset. The in-silico 
hexaploid wheat generated from the weighted 
tetraploid and diploid TPM values (referred to 
hereafter as the “expected” in-silico dataset) 
allows the direct comparison with the observed 
TPM values in SHW. We defined the seven ho- 
moeolog expression bias categories for both the 
expected in-silico and the observed SHW tran- 
scriptomes using the same methods as for CS 
and Azhurnaya and compared the classification 
of triads between the observed and expected data- 
sets (table S12). We next compared classifications 
to modern-day bread wheat CS and Azhurnaya. 
To enable a meaningful comparison across sim- 
ilar tissues from the Hao et al study (25) we used 
nine samples from the PAMP Triggered Immune 
Response dataset from CS and six samples from 
the Azhurnaya dataset (table S1). As before, we 
defined the seven homoeolog expression cat- 
egories for the defined CS and Azhurnaya data- 
sets and compared them with the SHW and the 
in-silico classifications (table S11). 


DNA methylation plant material and 
library preparation 


Plants were grown as described in the Chinese 
Spring tissues study. The frozen leaves from 
the five samples at 3-leaf stage (Zadok stage 13) 
were ground and divided as input for the prep- 
aration of both RNA-seq libraries (detailed in 
Chinese Spring tissues study) and whole genome 
bisulfite sequencing (WGBS) libraries. These 
samples enabled direct comparisons between 
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the DNA methylation profile and homoeolog 
expression patterns in the same samples. WGBS 
libraries were constructed from purified nuclei 
prepared using the published methods (57). Input 
DNA was quantified using the Qubit high sensi- 
tivity DNA kit. A total of 500 ng of nuclear DNA 
was spiked with 270 pg of Lambda DNA to as- 
sess the conversion efficiency obtained using 
the EZ DNA Methylation-GoldTM Kit (Zymo 
research corp, Irvine, Ca, USA). WGBS libra- 
ries were prepared using the TruSeq DNA kit, 
(Illumina, Madison, WI) and 2 x 125 bp paired- 
end sequence reads was generated using the 
Illumina HiSeq 2500 v4 platform (Genome 
Quebec, Montreal, QC, Canada). The data was 
deposited as SRP133674. 


DNA methylation data analysis 


Sequence quality and adaptor removal was per- 
formed using Trim_galore_v0.4.1 (52). High qual- 
ity paired-end sequence reads were aligned to 
the RefSeqvl.0 Chinese Spring genome using 
Bismark version 0.16.1 (53) ensuring the remov- 
al of duplicate reads and only retaining unique 
unambiguous alignments. The data were pro- 
cessed to exclude regions with low coverage using 
a binomial test. The methylation data was anno- 
tated using the gene feature coordinates provided 
by the RefSeqv1.0 Chinese Spring gene defini- 
tions. 5 kb flanking regions around the gene 
features were also extracted. The two flanking 
regions and the gene feature were each divided 
into 50 tiles (150 tiles in total) to summarize the 
observed methylation ratios. Data manipulation, 
statistical analysis and image generation were 
performed using the R language (54) utilizing 
the data.table (55), MethylKit v1.5.2 (56), genoma- 
tion (57), and ggplot2 packages (58). 


Comparison of RNA-seq sample 
classification with DNA methylation 


For the five RNA-seq samples (from the same 
plants used for analyzing DNA methylation) we 
classified triads into the seven balanced, dom- 
inant, and suppressed categories using the same 
method as for previous analyses. We then classified 
homoeologs within dominant and suppressed 
triads into the “dominant” and “nondominant” 
homoeologs, and “suppressed” and “nonsup- 
pressed” homoeologs. For example, in an A 
dominant triad the A subgenome is classified as 
“dominant” and B and D subgenomes are classified 
as “nondominant” homoeologs. The DNA meth- 
ylation patterns of genes in each of these cate- 
gories were plotted using the methods described 
above (DNA methylation data analysis). Dif- 
ferences in DNA methylation levels between 
categories were tested pairwise using the non- 
parametric Mann-Whitney ¢ test using the 
wilcox.test() in R (fig. $17). 


Histone modification analysis 


To study the role of histone modifications we 
carried out ChIP-seq for three active marks 
(H3K36me3, H3K9ac, and H3K4me3) and one 
repressive mark (H3K27me3) (deposited under 
SRA accession number SRP126222). We used 
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Chinese Spring at 3-leaf stage, however RNA- 
seq data were not collected from these exact 
plants. To calculate the homoeolog expression 
bias we used Chinese Spring samples from a 
separate experiment (PAMP-triggered immune 
response study) in which the same tissue was 
collected at a similar stage (3-leaf stage). Whilst 
combining data from two separate experiments 
may introduce some noise into the analysis, the 
ChIP-seq and RNA-seq data are from similar 
tissues, at a similar growth stage, in the same 
wheat variety, and are thus highly comparable. 
Nevertheless, this confounding factor should 
be considered when interpreting these results. 
ChIP assays, DNA library preparation, and se- 
quencing were performed as in (10). 


Histone data analysis 


Raw FASTQ files were preprocessed with 
Trimmomatic v0.36 (59) to remove Illumina 
sequencing adapters, trim 5’ and 3’ ends with 
quality score below 5 (Phred+33) and discard 
reads shorter than 20 bp after trimming. Paired- 
ends reads were aligned against IWGSC RefSeq 
v1.0 assembly using bowtie2 v2.3.3 with -very- 
sensitive settings (60). Alignments with MAPQ < 
10 were discarded and duplicate reads removed 
with Picard MarkDuplicates (http://broadinstitute. 
github.io/picard/). Triad expression category was 
calculated using Chinese Spring samples from a 
separate experiment (PAMP-triggered immune 
response study) using the same method as in 
previous analyses. As in the DNA methylation 
analysis we classified homoeologs within dom- 
inant and suppressed triads into the “dominant” 
and “nondominant” homoeologs, and “suppressed” 
and “nonsuppressed” homoeologs. 

We calculated meta-gene profiles for each cat- 
egory by computing the read density of each 
histone mark over different triads categories 
using Deeptools (67) computeMatrix scale-regions 
and plotted it with plotProfile. To make a sta- 
tistical comparison, for each histone mark we 
scored the number of reads overlapping with 
gene bodies using bedtools coverage —counts 
(62). Only reads fully mapping within gene 
bodies were considered. To account for differ- 
ent gene size we divided the read counts over 
each gene by its length. The distributions of 
reads density over different triads categories 
were compared with a nonparametric ¢ test 
(Mann-Whitney U-test) using the function wilcow. 
test in R (fig. S11). 


Variation in homoeolog expression 
bias across tissues (stable and 
dynamic triads) 


To define the variation in homoeolog expression 
bias of each triad across the intermediate tissues 
we calculated the Euclidean distance between 
the triad’s global position (combined analysis) 
and each individual tissue in which the triad was 
considered expressed. We included only triads 
which were considered expressed in at least six 
tissues based on the combined analysis criteria 
outlined above. The average of these distances 
was defined as the “triad mean distance”. We 
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ranked triads by their triad mean distance and 
the percentile was calculated by 


. ( rank(cmd;) ) 
percentile; = truncate 


length(CMD) 


where CMD is the vector containing all the triad 
mean distance. The first and last deciles were 
classified as stable 10% and dynamic 10% triads, 
respectively. A similar approach was used to de- 
fine the corresponding 5% and 25% extremes of 
the distribution. This analysis was conducted inde- 
pendently for the Chinese Spring no stress sam- 
ples, the Azhurnaya developmental time course, 
and for each of the four tissue-specific networks. 
A visual representation is provided in fig. S19. 


TE presence in gene promoters 


We extracted all TEs that were annotated to fall 
at least partly within 1.5 kb and 5 kb upstream 
of the canonical ATG start-codon for all genes. 
We then split the TEs into the relevant gene lists 
covering homoeolog expression bias variation 
(stable 10%, middle 80%, and dynamic 10%) and 
homoeolog expression bias (balanced, dominant, 
nondominant, suppressed, and nonsuppressed) 
based on the “combined analysis (all tissues)” for 
CS. We used these lists to identify the proportion 
of genes and triads in each category which con- 
tained at least one TE in the promoter region. 


Enrichment of TE families in 
gene promoters 


Using the GFF file of TE coordinates, we ex- 
tracted TEs present in the promoter regions of 
HC genes. We retrieved all TE copies that are 
entirely or partially present in the 5 kb upstream 
of the ATG start-codon of the canonical tran- 
script for each gene. We then calculated the 
number of genes in each of the stable 10%, mid- 
dle 80%, and dynamic 10% categories which 
contained specific TE families. We required the 
TE family to be present in at least 2% of the 
categorized genes for further analysis. We then 
found the deviation of this distribution from the 
expected 10-80-10 ratio using the x test, P values 
adjusted with Benjamini-Hochberg. We calcu- 
lated the median length of each TE family based 
on all instances of that TE across the genome. 
We found fifteen TE families deviated signif- 
icantly from the expected 10-80-10 distribu- 
tion (Benjamini-Hochberg P < 0.01). However, 
the majority of these TE families were present 
in less than 5% of the genes considered, and 
showed very small variation in the number of 
promoters containing the TE, suggesting that 
the statistical significance may not be biologi- 
cally relevant. 


TE density in gene promoters 


We calculated the density of TEs within 5 kb 
upstream of genes by calculating the propor- 
tion of TE bases in sliding windows of 100 bp 
with a step size of 10 bp. The mean of each 
window was then calculated for both the stable 
10%, middle 80%, and dynamic 10% triads and 
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the subgenome dominance categories (balanced, 
dominant, nondominant, suppressed, and non- 
suppressed). Mann Whitney tests with Benjamini- 
Hochberg adjusted P values were used to test 
for differences in TE density between catego- 
ries across each window. 


Transcription factor binding 
site identification 


The 1.5 kb of sequence upstream of the canonical 
ATG start-codon was used to identify transcrip- 
tion factor binding sites (TFBS) present in pro- 
moters of HC triads. The FIMO tool from the 
MEME suite [v 4.11.4 (63)] was used with a 
position weight matrix (PWM) obtained from 
plantPAN 2.0 (64) to predict TFBS based on 
previously identified sites across multiple plant 
species. FIMO was run with a P value threshold 
of <1E-04 (default), -motif-pseudo set to 1E-08 as 
recommended for use with PWMs and a -max- 
stored-scores of 1,000,000 to account for the 
large size of the dataset. The background model 
was generated from all extracted promoter se- 
quences using the MEME fasta-get-markov com- 
mand. Details of the TFBS comparisons between 
homoeologs is presented in the supplementary 
materials (13). 


WGCNA network construction 


Coexpression networks were built for six sepa- 
rate sample sets: grain, leaf, spike, root, abiotic 
and disease (table S1) using the WGCNA R pack- 
age (65). For each network, we selected HC genes 
which were expressed > 0.5 TPM in three or more 
samples. The count expression level of each gene 
was normalized using variance stabilizing trans- 
formation from DESeq2 (66) to eliminate differ- 
ences in sequencing depth between studies. The 
soft power threshold was calculated as the first 
power to exceed a scale-free topology fit index of 
0.9 for each network separately. The soft powers 
used were: leaf = 12, spike = 12, roots = 7, disease = 
7. For the abiotic and grain network the 0.9 
threshold was not crossed until 15 and 20 re- 
spectively, which may be due to strong differ- 
ences between samples within these datasets, 
therefore the soft power threshold was selected 
according to the number of samples, resulting 
in abiotic = 7 and grain = 6. Signed hybrid net- 
works were constructed blockwise using the 
function blockwiseModules() with a maximum 
block size of 46,000 genes. The correlation type 
used was biweight mid-correlation “bicor” and 
the maxPOutliers was set to 0.05 to eliminate 
effects of outlier samples. The topographical 
overlap matrices (TOM) were calculated by the 
blockwiseModules function using TOMType = 
“unsigned” and the minimum module size was 
set to 30. The parameter mergeCutHeight = 0.15 
was used to merge similar modules. 


Defining same, similar, and divergent 
expression patterns of triads 


For triads which had homoeologs within dif- 
ferent modules in the WGCNA networks we 
developed a threshold to determine whether 
the different modules had a “similar” or “di- 
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vergent” expression pattern. We calculated the 
Euclidean distance between module eigengenes 
using the R package dist() and with these values 
we calculate the distances between the homoeo- 
logs in each triad. Triads where the pairwise 
distances were zero were in the same module. 
Triads where the pairwise distances were over 
zero were in different modules. For these triads 
in different modules when the pairwise distance 
between any two homoeologs was > 50% of the 
median maximum distance between eigengenes, 
the triad was classified as having a “divergent” 
expression pattern. In cases where the pairwise 
distance was over zero between at least one 
pair of homoeologs and the distance between 
all three pairs of homoeologs were =< 50% of 
the median maximum distance, the triad was 
classified as having a “similar” expression pat- 
tern. The median maximum distance between 
eigengenes was averaged across all four tissue 
networks to give a final threshold (50% of me- 
dian maximum distance) of 0.937431. This anal- 
ysis was carried out for 1:1:1 syntenic and 1:1:1 
nonsyntenic triads expressed in each of the tis- 
sue networks (total triads = 9599 grain, 5378 leaf, 
11,038 root, and 6173 spike). This excluded triads 
which had a putative transposable element (67 
triads). Fig. S23 shows a graphic representation 
of this classification and the effect of altering the 
threshold in each of the four networks. 


Module overlaps 


Module overlap between networks was calcu- 
lated using the R package GeneOverlap which 
calculates significant overlaps between modules 
using a Fisher’s exact test. Modules were con- 
sidered to have significant overlaps when the 
FDR adjusted P value < 0.05. 


Correlation to stress status 


Modules within the abiotic and disease networks 
were tested for correlations to intermediate level 
stresses using the cor(function). The significant 
of correlations were calculated using the func- 
tion corPvalueStudentQ and corrected for multi- 
ple testing using p.adjust() using the Benjamini 
& Yekutieli method (67). 


Genie3 


HC genes expressed >5 TPM in at least one of 
the 850 sample were selected. Out of these 
78,085 genes there were 3386 transcription 
factors (methods described above). Random 
forest regression was estimated for each gene 
based on the transcription factors as inputs using 
the genie3 package (68) in R (version 3.3.2) with 
default parameters (K=sqrt, nb.trees=1000, 
input.idx=list of transcription factors, importance. 
measure=IncNodePurity, seed=NULL). For each 
transcription factor, all predicted target genes 
(connectivity > 0.005) were extracted and func- 
tional enrichment within the target genes was 
determined using topGO (69) in R (version 3.3.2) 
with the following parameters (ontology = “BP”, 
nodeSize = 10, classic Fisher test P < 10°). To 
summarize the results, the top three GO terms 
for each transcription factor, the P values for the 
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strongest enrichment, and the direct blastx match 
in the Arabidopsis proteome (tair10) and well as 
the e-value and description were tabulated (table 
$28). The complete list of GO term enrichments 
of the biological process ontology for each tran- 
scription factor and the list of transcription 
factors associated with each GO term in the on- 
tology of biological process are published in e! 
DAL (https://doi.ipk-gatersleben.de/DOI/53148abd- 
26a1-4.ede-802b-c2635af6a725/0dd8224a-34fc-4e3b- 
8ab8-883d07e52bd2/2/1847940088). 


Identifying highly connected hub genes 


Hub genes within each module for the abiotic and 
disease stress networks were calculated using the 
WGCNA R package function signedKME(). This 
calculates the correlation between the expression 
patterns of each gene and the module eigengene. 
Genes which were more highly correlated to the 
eigengene were considered hub genes. 
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DEVELOPMENTAL SIGNALING 


A molecular mechanism for Wnt 
ligand-specific signaling 
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Patricia Tebabi, Andra C. Dumitru, Melanie Koehler, Philipp Gut, David Alsteens, 
Didier Y. R. Stainier, Abel Garcia-Pino, Benoit Vanhollebeket 


INTRODUCTION: Wnt signaling is an ancient 
signaling pathway that has accompanied the 
emergence of metazoans and is key to many 
developmental, physiological, and disease pro- 
cesses. Similar to other signaling pathways, 
gene families for both Wnt ligand and its 
corresponding Frizzled receptor have under- 
gone extensive expansion during metazoan 
evolution. Vertebrate genomes harbor 19 closely 
related Wnt genes as well as 10 Frizzled genes. 
Gene duplication is typically considered a major 
driving force in the evolution of new biological 
functions through neo- or subfunctionalization 
of emerging paralogs. How this functional di- 
versification of Wnt ligands is structurally and 
molecularly organized, however, remains poorly 
understood. The Wnt/Frizzled molecular inter- 
action is mediated by residues conserved across 


Complex Wnt 
microenvironment 


divergent 


linker 
Wnt7 


Reck <> ( 
‘XN 


conserved 
residues 


4 


ees 


Fz perception 


both families. This promiscuous interaction is 
incompatible with monospecific recognition 
and, accordingly, when tested in pair-wise com- 
binations, multiple Wnt ligands compete for 
binding to various Frizzled receptors. 


RATIONALE: These observations raise the 
questions of how Wnt ligands achieve func- 
tional diversification and how cells interpret 
the intermingled expression patterns of simul- 
taneous and sometimes conflicting Wnt sig- 
nals. In some biological settings, cells may 
integrate all signaling inputs nondiscriminately 
and trigger appropriate responses by consider- 
ing their total net balance. However, other 
biological processes exhibit strict Wnt ligand 
selectivity, despite complex Wnt/Frizzled ex- 
pression landscapes. A prototypical example 


Gpr124/Reck 
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Task sharing for orchestrated Wnt7-specific cellular responses. (Top) Gpr124 and Reck 
cooperatively alter the cell's perception of its Wnt microenvironment by selectively potentiating 
Wnt7 signals (cyan-tinted dots). (Bottom) Reck decodes Wnt ligands by establishing 
monospecific contacts with the highly divergent Wnt7 linker domain. Gpr124 links Reck-bound 
Wnt7 to Dishevelled. Dishevelled polymers by interacting simultaneously with Gpr124 and 

Fz assemble Wnt7-enriched signalosomes that trigger signaling through Fz receptors 


and Lrp5/6 co-receptors. 
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is provided by the exclusive control of mam- 
malian forebrain and ventral spinal cord an- 
giogenesis by Wnt7a and Wnt7b. 

Within this neurovascular unit, in order to 
respond to neural progenitor-derived Wnt7 
by activating Wnt/$-catenin signaling, cerebral 
endothelial cells must express a membrane 
protein complex consisting of the adhesion 
G protein-coupled receptor (GPCR) Gpr124 
and the glycosylphosphatidylinositol-anchored 
glycoprotein Reck. This Gpr124/Reck complex 
was recently reported to promote Wnt?7-specific 
responses. 


RESULTs: Using a combination of biophysical 
approaches and ligand-binding assays in gen- 
etically engineered cells, we demonstrate that 
ligand selectivity is conferred by Reck, which 
mediates Wnt7-specific binding in a Frizzled- 


independent manner. Reck 


orchestrates Wnt ligand 


Read the full article discrimination by engaging 
at http://dx.doi. the structurally disordered 
org/10.1126/ and highly divergent linker 


science.aatl178 


domain of Wnt?7. The pres- 
ence of Gpr124 is required 
to deliver Reck-bound Wnt7 to Frizzled by as- 
sembling higher-order Reck/Gpr124/Frizzled/ 
Lrp5/6 complexes. This Gpr124 tethering func- 
tion does not rely on its GPCR structure but in- 
stead on its combined capacity to interact with 
Reck extracellularly and recruit the Dishevelled 
scaffolding protein intracellularly. By bridging 
Gpr124 and Frizzled, Dishevelled recruits Wnt7, 
via its association with Reck, into dynamic Wnt/ 
Frizzled signalosomes, resulting in increased 
local concentrations of ligand available for 
Frizzled signaling. 


CONCLUSION: Our data reveal that cells are 
equipped with “Wnt-decoding modules” that 
distinguish between Wnt ligands that are other- 
wise very similar. They also reveal a critical 
role for the linker domain in Wnt ligand evo- 
lution and functional diversification. These 
mechanistic insights into the Wnt decoding 
capacities of vertebrate cells predict that ad- 
ditional Wnt decoding modules exist, enabling 
fine-tuning of cellular behaviors in response to 
other Wnt or Frizzled family members. These 
modules expand the diversity of proximal 
events in Wnt signaling, opening new ther- 
apeutic opportunities for conditions in which 
Wnt stimulation or inhibition are desirable at 
the membrane level. In particular, the mech- 
anisms uncovered here provide an opportunity 
for the targeted treatment of human central 
nervous system neurovascular disorders. 


The list of author affiliations is available in the full article online. 
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Wnt signaling is key to many developmental, physiological, and disease processes in which 
cells seem able to discriminate between multiple Wnt ligands. This selective Wnt 
recognition or “decoding” capacity has remained enigmatic because Wnt/Frizzled 
interactions are largely incompatible with monospecific recognition. Gpr124 and Reck 
enable brain endothelial cells to selectively respond to Wnt7. We show that Reck binds with 
low micromolar affinity to the intrinsically disordered linker region of Wnt7. Availability of 
Reck-bound Wnt7 for Frizzled signaling relies on the interaction between Gpr124 and 
Dishevelled. Through polymerization, Dishevelled recruits Gpr124 and the associated 
Reck-bound Wnt7 into dynamic Wnt/Frizzled/Lrp5/6 signalosomes, resulting in increased 
local concentrations of Wnt7 available for Frizzled signaling. This work provides 
mechanistic insights into the Wnt decoding capacities of vertebrate cells and unravels 
structural determinants of the functional diversification of Wnt family members. 


nts constitute a large family of highly 
conserved and secreted proteins that 
mediate intercellular communication 
during animal development and in adult 
tissue homeostasis (J, 2). The 10 mem- 
bers of the Frizzled (Fz) family are seven-pass 
transmembrane proteins that serve as receptors 
for Wnts (3-5). Greatly contributing to the com- 
plexity of Wnt signaling, the Wnt/Fz binding 
relationships are promiscuous, with multiple 
Wnts competing for binding to individual Fzs 
and vice versa (3-12). The Wnt/Fz contacts are 
mediated by conserved residues or common 
chemical modifications (13). These observations 
raise the question of how cells can respond to 
specific Wnt ligands when exposed to the over- 
lapping expression patterns of multiple Wnt 
ligands that sometimes have opposing bio- 
logical functions. 
A pertinent example is the exclusive control 
of mammalian forebrain and ventral spinal cord 
angiogenesis by Wnt7a and Wnt7b (14-16). Spe- 
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cifically, in order to respond to neural progenitor- 
derived Wnt7 by activating Wnt/B-catenin sig- 
naling, endothelial cells must express Gpr124, 
an orphan member of the adhesion class of G 
protein-coupled receptors (GPCRs) (17-22) as 
well as the glycosylphosphatidylinositol (GPI)- 
anchored glycoprotein Reck (22, 24). Gpr124 
and Reck physically interact to synergistically 
stimulate Wnt7-specific responses (22, 24), but 
it is unknown how Wnt7 signals are specifically 
recognized and transduced. 


Reck is a Frizzled-independent 
Wnt7-specific receptor 


We first sought to determine the Wnt7 recog- 
nition mechanism, a question inherently com- 
plicated by the ubiquitous expression of Fz 
receptors and their Lrp5/6 co-receptors in ver- 
tebrate cells. We therefore generated a set of 
mutant human embryonic kidney (HEK)-293 
cell lines by targeting (i) all 10 FZ genes (FZ; '~), 
(ii) LRP5 and/or LRP6, or (iii) GPR124 and RECK, 
through multiplexed CRISPR/Cas9 mutagenesis 
(Materials and methods, Fig. 1A, and figs. S1 
to S3). Ectopically expressed V5-tagged Wnt7a 
(Wnt7a-V5) could be immunodetected at the 
plasma membrane of wild-type (WT), FZ,19", 
and LRP5~/~;LRP6~"~ cells but not GPR1247/"; 
RECK~~ cells (Fig. 1B). Ectopic restoration of Reck, 
alone or in combination with Gpr124, was suffi- 
cient to restore Wnt7a-V5 membrane labeling of 
GPR124-;RECK ~~ cells (Fig. 1C). Fz5 also bound 
Wnt7a-V5, reflecting the competence of this re- 
ceptor to mediate baseline Wnt7 signaling (11, 24). 
Control Wnt3a-V5 did not label WT or mutant 
cells (Fig. 1B and fig. $4). 

Proximity ligation assays (PLAs) allow localized 
detection of protein interactions at the single- 
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molecule level through DNA rolling-circle signal 
amplification. PLAs in GPR124/~;RECK~"~ cells 
confirmed the interaction between Wnt7a-V5 
and hemagglutinin (HA)-Reck or HA-Fz5 at the 
plasma membrane. The fraction of PLA-positive 
cells was identical (28.8 + 7.5% and 28.3 + 7.0%, 
respectively), but the intensity of the PLA sig- 
nals generated with HA-Reck was twofold higher 
than with HA-Fz5 (Fig. 1D) (quantification pro- 
tocol is provided in Materials and methods). By 
contrast, HA-Gpr124 did not generate PLA sig- 
nals. We tested all the Wnt family members and 
found that PLA interaction signals with HA-Reck 
were restricted to Wnt7a-V5 and Wnt7b-V5 (Fig. 
1E), reflecting the specificity of the Gpr124/Reck 
complex for Wnt7 signaling (20-22, 24). 

We next determined whether Reck recruits 
Wnt7a in the absence of functional Fz receptors. 
We first performed PLAs between V5-tagged 
Wnt7a or Wnt3a secreted from WT cells and 
HA-Reck exposed at the surface of neighboring 
green fluorescent protein (GFP)-labeled FZ,49 ~ 
cells. PLA signals were detected at the plasma 
membrane of 57.7 + 17.4% of the GFP* cells in 
Wnt7a* cocultures but were undetectable in 
Wnt3a* cocultures (Fig. 1F). Accordingly, the co- 
culture of FZ,;9/~ cells expressing Reck but not 
Gpri24“" could drastically reduce Wnt7a-Fz5 
signaling in neighboring Super Top Flash (STF) 
reporter cells (Fig. 1G). By contrast, none of the 
other tested Wnts, including Wnt3a (Fig. 1G), 
could be trapped by Reck-expressing cells (fig. 
85). In this ligand capture assay, Gpr124 was 
lacking most of its C-terminal intracellular do- 
main (ICD) domain (residues 81 to 337 of the 
ICD) to restrict the analysis to the extracellular 
parts of the Gpr124/Reck complex. 


Defining the Reck-Wnt7 monospecific 
recognition mechanism 


Next, we mapped the domains of Reck required 
for Wnt7a binding. We generated a collection of 
HA-tagged single-domain deletion variants of 
Reck and, after determining that all variants 
reached the plasma membrane (fig. S6), quanti- 
fied Wnt7 PLA interaction signals. This analysis 
revealed that the N-terminal cysteine-knot do- 
main (CK), and more particularly CK4 and CK5, 
were required for binding (Fig. 2A). Consistent 
with these results, Reck*“™* was also inactive in 
competition assays (Fig. 1G). 

Reck function in Wnt signaling is known to 
rely on its capacity to form a complex with Gpr124 
through its CK domain (22, 24), which we show to 
be required for Wnt7a binding. Although Gpr124: 
alone did not bind Wnt7 (Fig. 1, C, D, and G), we 
detected a fourfold increase in Wnt7a-V5/HA- 
Reck PLA signals upon coexpression of untagged 
Gpr124 (Fig. 2B). These results suggest that Reck 
is a Fz-independent Wnt receptor, whose specific 
and exclusive binding to Wnt7 is reinforced by 
the interaction of Gpr124 with its CK domain 
(Fig. 2C) (24). 

In order to better characterize the Reck/Wnt7 
interaction, we modeled the three-dimensional 
(3D) structure of Wnt7a based on Xenopus Wnt8a 
crystallographic analysis (Fig. 2D) (73). Wnt 
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Fig. 1. Reck is a Frizzled-independent Wnt7-specific receptor. 

(A) Schematics of the Wnt7 receptor complex components and strategy 
for their genetic inactivation. Cells transfected with a bicistronic 
construct encoding the sgRNA and SpCas9-2A-GFP were iteratively 
cloned by means of fluorescence-activated cell sorting (FACS) (49). 
(B) Anti-V5 immunostaining of transiently expressed Wnt7a-V5 or 
Wnt3a-V5 in cells of the indicated genotype. (C) Same as (B) for cells 
transiently coexpressing Wnt7a-V5 and the indicated receptors and 
co-receptors. (D) PLA using antibody to V5 and antibody to HA 
directed against Wnt7a-V5 and the indicated HA-tagged receptors and 
co-receptors expressed in GPR124~’-;RECK ~’~ cells. Signal quantifica- 
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Wnt3a 


Wnt7a 


tion is on the right (Materials and methods). (E) Anti-V5/HA PLAs 
between the indicated V5-tagged Wnt ligand and HA-Reck in GPR124-; 
RECK~’~ cells. (F) Anti-V5/HA PLAs in (1:1) cocultures of Wnt7a-V5- 
or Wnt3a-V5-secreting WT cells and FZ1-10 ’ HEK293T cells coex- 
pressing HA-Reck and cytosolic GFP as a transfection marker. 

(G) Ligand capture assay in triple (1:1:1) cocultures. Luciferase activities 
of HEK293-STF reporter cells transfected with Fz5 (yellow cell) and 
stimulated in a paracrine manner by cocultured Wnt7a- or Wnt3a- 
secreting cells in the presence of FZ;.190 ’ HEK293T competing cells 
transfected with various receptors and co-receptors as indicated. Scale 
bars, 10 um. ***P < 0.001; data represent mean + SD. 
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Fig. 2. Wnt7 recognition involves its intrinsically disordered linker 
region. (A) Anti-V5/HA PLAs between Wnt7a-V5 and HA-Reck or its 
variants coexpressed in GPR1I24-’-;RECK~“~ cells in the absence of 
Gpr124. Quantification is shown on the right. (B) Quantification as in (A) in 
the presence of Gpr124. (C) Reck and its Gpr124 and Wnt7 binding 
partners. (D) Structural modeling of Wnt7a based on the crystal 
coordinates of XWnt8a. The NTD and CTD are pseudocolored in cyan and 
gray, respectively. The NTD-embedded linker region is circled in purple. 
(E) Schematic representation of the V5-tagged Wnt variants examined in 
the anti-V5/HA PLAs shown and quantified in (F) in GPR124-7-;RECK “~ 
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cells coexpressing HA-Reck. (G) Gpr124/Reck—dependent luciferase activ- 
ities of 101 different single-residue variants of Wnt7a normalized to WT 
Wnt7a in HEK293-STF cells. (H) Alignment of the Wnt7 linker domain across 
the vertebrate clade. Mm, Mus musculus; Gg, Gallus gallus; Xl, Xenopus 
laevis; Dr, Danio rerio. The residues of Mm Wnt7a are color-coded according 


to their activity class determined in (G). (I) Anti-V5/HA PLAs between 


Wnt7a-V5 or Wnt7a*“-V5 and HA-Reck. Ligand secretion was evaluated by 
means of semiquantitative anti-V5 dot blot analysis of serially diluted cell 
supernatant. Ponceau S staining was used as the loading control. Scale 
bars, 10 um. *P < 0.05, ***P < 0.001; data represent mean + SD. 
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ligands adopt a two-domain structure reminis- 
cent of a human hand pinching the globular Fz 
cysteine-rich domain (CRD) (Fig. 2D, orange) 
via the palmitoylated “thumb” of their N-terminal 
domain (NTD) (Fig. 2D, cyan) and hydrophobic 
residues of their “index” C-terminal domain 
(CTD) (Fig. 2D, gray). The structures involved 
in Fz binding are pseudocolored in orange in 
Fig. 2D. The two domains are connected through 
a flexible linker region of the NTD (Fig. 2D, purple). 


We next generated a collection of Wnt7 var- 
jants deleted for specific domains or residues or 
that carried domains from other Wnt ligands 
(Fig. 2E). All analyzed Wnt7 variants could be 
detected in the cell supernatant (fig. S7). These 
ligands were each applied to PLAs, and the 
resulting data showed that Reck binding occurs 
through the Wnt7a NTD. Wnt7a""” indeed bound 
Reck, whereas Wnt7a"? did not (Fig. 2F). Chi- 
meric ligands made of Wnt7a\"” fused to the 


CTDs of Xenopus Wnt8a (XWnt8a) or mouse 
Wnt4 or Wnt16 were also positive. The palmit- 
oleic acid required for Fz binding was dispens- 
able for the interaction with Reck, as revealed 
through testing a Wnt7a°”°™ palmitoylation 
mutant. Altogether, these binding assays re- 
veal that Reck discriminates between Wnt lig- 
ands by recognizing a motif embedded in the 
Wnt7a NTD at sites distinct from those engaged 
by Fz. 
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Fig. 3. The Wnt7 linker peptide binds the Reck CK domain with low 
micromolar affinity. (A) Coomassie Blue staining of the recombinant 
Reck-CK-Fc fusion proteins used in biophysical analyses. (B) Interaction 
between Reck-CK-Fe and synthetic pWnt7 peptides by use of isothermal 
titration calorimetry (ITC). (Top) ITC of pWnt7Za, pWnt7a*“, and pWnt3a into 
Reck-CK-Fc. (Middle and bottom) ITC of Reck-CK-Fc and its CK motif variants 
to pWnt7b. (C) Probing Reck-CK-Fc/pWnt7b interaction by use of SMFS. 
Principle of force-distance curve-based atomic force microscopy (FD-based 
AFM) is shown. An AFM tip functionalized with a polyethylene glycol (PEG) 
spacer fused to the pWnt7b peptide is approached and retracted from a 
surface coated with Reck-CK-Fc. (D) Force-time curve from which the loading 
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rate (LR) can be extracted from the slope of the curve just before bond rupture 
(LR = AF/At). (E) Force-distance curves recorded at various retraction 
speeds (from 20 nm sto 20 ums) and showing no adhesion event (first 
curve) or specific adhesion events (other curves). (F) Loading rate—dependent 
interaction forces of single Wnt7b peptide-receptor bonds quantitate the 
ligand-binding energy landscape of Reck. (Inset) Fitting the data by using the 
Friddle-Noy-de Yoreo model (thin red line) (54) provides average equilibrium 
force (Feq), barrier location of the unbound state (xu), equilibrium free energy 
between the bound and unbound state (AG,,), intrinsic unbinding rate 

(k°,.), and Kg, with errors representing the S.E.. Each circle represents one 
measurement. Darker shaded areas represent 95% prediction interval. 
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Smaller NTD variants lacking the linker re- 
gion, Wnt7a!?” and Wnt7'?*’, did not bind 
Reck. This mutational analysis and the spatial 
segregation of the Wnt7a linker region from the 
Fz binding sites suggest that Reck decodes 
Wnit7a, at least in part, through this linker. The 
linker, which is highly divergent among the 
different Wnt ligands, exhibits strong evolution- 
ary conservation among the vertebrate Wnt7 
orthologs (Fig. 2H and fig. S8). This region is 
predicted to be intrinsically disordered (fig. S9), 
a feature often found in molecular recognition 
elements providing the necessary structural plas- 
ticity to accommodate multiple partners, post- 
translational modifications, or moonlighting 
functions. In many cases, intrinsically disordered 
regions also provide interactions with high spec- 
ificity and moderate-to-high affinity (25). 

Although in situ substitution of the XWnt8a 
linker by the Wnt? linker (XW8aP“™) did not 
yield detectable PLA signals, presenting the Wnt7 
linker at the free N terminus of XWnt8a (pWnt7a- 
XW68a) was sufficient to confer Reck binding 
activity to XWnt8a (Fig. 2, E and F). We hypoth- 
esize that conformational alteration accounts 
for the lack of binding of XW8aP“""™. In sup- 
port of this hypothesis, reciprocal exchange of 
Wnt linkers between various Wnts abrogates 
their activity (fig. S10). 

To precisely map the Reck interaction site, we 
analyzed Gpr124/Reck-dependent STF signaling 
of 101 single-residue variants of Wnt7a (Fig. 2G). 
The mutated residues correspond to surface- 
exposed NTD residues conserved between Wnt7a 
and Wnt7b but not found in XWnt8a or other 
Wnt ligands. Residues were mutated to alanines, 
except for endogenous alanine residues, which 
were changed to arginine. Although ~80% of the 
variants were as active as WI Wnt7a (>70% 
relative activity) (Fig. 2G, green), eight Wnt7a 
variants (Fig. 2G, red) reduced Gpr124/Reck- 
dependent signaling to <10%. All but one (137) 
critical residues clustered on the “top” or “back” 
of the predicted Wnt?7a structure, with six map- 
ping to the linker domain. All essential residues 
are strictly conserved among Wnt?7 orthologs 
from fish to mammals and absent in other Wnts, 
including Wnt3a (Fig. 2H and fig. S8). 

The linker domain of Wnt3a has been shown 
to be essential for Wnt3a activity through Lrp6 
binding (26). By analogy, the inactivity of the 
Wnt? linker variants might therefore result from 
defective binding to Lrp5/6, Reck, or both. In line 
with a function in Reck binding, Wnt7a*“, a four- 
residue variant of Wnt7a (V241A/F251A/L252A/ 
K262A) within the linker region (Fig. 2H), showed 
reduced Reck PLA signals as compared with that 
of WT Wnt7a. This lower activity occurred despite 
slightly improved secretion rates (Fig. 21). 


Biophysical characterization of the 
interaction between the Wnt7 linker 
domain and Reck 


To investigate Reck-Wnt7 binding in a cell-free 
system, the CK domain of Reck (and variants 
thereof) were fused to the Fc domain of human 
immunoglobulin G1 (IgG1). Fusion proteins were 


Eubelen et al., Science 361, eaatl178 (2018) 


purified from HEK293T cells supernatants (Fig. 3A) 
and then titrated with synthetic pWnt7a and 
pWnt?7b linker peptides by means of isothermal 
titration calorimetry (ITC). pWnt7a and pWnt7b 
bound Reck with affinity values of 7 and 1.2 uM, 
respectively (Fig. 3B and fig. S11). As controls, the 
synthetic peptides corresponding to Wnt7a*“ 
(pWnt7a**) as well as equivalent linker peptides 
of Wnt3a (pWnt3a) showed no binding to Reck- 
CK-Fc. pWnt7b binding required Reck CK4 and 
CK5 but not CK1, CK2, or CK3, mirroring the 
PLA results in cultured cells (Fig. 2A). To cor- 
roborate the results provided by the ITC analysis, 
we used single-molecule force spectroscopy (SMFS) 
to measure binding affinities at the single-molecule 
level (Fig. 3C). Binding of pWnt7b to Reck-CK- 
Fc was detectable with a measured dissociation 
constant (Kg) of 5 uM (Fig. 3, D to F). Despite the 
fundamental differences between the two tech- 
niques, ITC and SMFS thus provided a close match 
between measured binding affinity values. 

Altogether, although not excluding an addi- 
tional role of the Wnt7 linker in Lrp5/6 bind- 
ing (26), these data demonstrate that Wnt7 is 
recognized by Reck at least in part through its 
“signature” linker motif. The moderate micro- 
molar affinity values measured further suggest 
that after recognition of the linker, Reck estab- 
lishes more extensive contacts with Wnt7, in 
a process that can be potentiated by Gpr124 
(Fig. 2B). 


Gpri124 function in Wnt7 signaling does 
not depend on its GPCR structure 


Reck, by virtue of its GPI-anchoring mode, has 
limited potential to relay Wnt7 signals within 
the cell. Signal transduction therefore likely relies 
on other components of the receptor complex— 
Gpr124 and/or Fz/Lrp5/6 (Fig. 4A). 

To uncover the signal transduction mecha- 
nism, we first evaluated the functional relation- 
ship between Gpr124 and Fz/Lrp5/6 complexes 
in cultured cells. Using the Fz and Lrp mutant 
cells (Fig. 1A), we found that the function of 
Gpr124/Reck strictly relies on Fz and Lrp5/6 (fig. 
$12). We further established that their respective 
CRD and Dkk-1-sensitive Wnt ligand-binding do- 
mains are essential, implying that Wnt7 binds 
and activates Fz/Lrp5/6 in a classical manner. 
If Reck mediates Wnt7 binding and Fz/Lrp5/6 
trigger signaling, what underlies the essential 
function of Gpr124? 

In the absence of Gpr124, Reck acts cell- 
autonomously as a potent inhibitor of Wnt7/Fz5 
signaling, in a CK4-dependent manner (Fig. 4B). 
The inhibitory function of standalone Reck is 
particularly remarkable in light of the pool of 
membrane-associated Wnt7 in this Frizzled- 
positive setting (Fig. 4B). This observation implies 
that in the absence of Gpr124, Reck scavenges 
Wnt7 away from Fz5/Lrp5/6 complexes. The 
presence of Gpr124 switches Wnt7 signaling 
output from near complete inhibition to potent 
activation. 

To establish how Gpr124 mediates this “on-off” 
Wnt7 signaling switch, we turned to the zebrafish 
model. In many Wnt-controlled processes, includ- 
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ing the Wnt7/Gpr124/Reck-mediated cerebro- 
vascular functions, Wnt input levels are only 
marginally above the minimal threshold values 
required for signaling (20). It is therefore im- 
portant to investigate the signal transduction 
pathway in vivo, in response to physiological 
Wnt7 inputs. The development of the zebrafish 
brain vasculature requires Reck/Gpr124 signaling 
in a process of angiogenic sprouting that can 
readily be quantified. It therefore constitutes 
an ideal setting in which to perform structure- 
function analysis in vivo (22, 27). Using mRNA 
injections into one-cell stage gpr124-/~ embryos 
(Fig. 4C), we evaluated the activity of three Gpr124: 
variants lacking the N-terminal extracellular part 
(Gpri244=™), the seven-span moiety (Gpri244™"), 
or the C-terminal cytoplasmic extension (Gpri24“™™, 
lacking residues 29 to 337 of the ICD) (Fig. 4D). 
Although ectopic expression of Gpri244"& or 
Gpri24 did not restore brain angiogenesis 
in gpri24 mutants or morphants, Gpr1244™?7 
was sufficient to trigger brain angiogenesis in vivo 
(Fig. 4, E and F, and fig. $13) and Wnt/B-catenin 
activity in vitro (fig. S14). 

This retained competence of Gpr124“7™” was 
unexpected: Gpr124 is a GPCR, a receptor super- 
family classically relaying extracellular stimuli 
through ligand-induced conformational remodel- 
ing of their seven transmembrane spans, which 
are absent in the engineered Gpr124“'™””. These 
data raise the possibility that Gpr124 does not act 
as a “classical” GPCR when promoting Wnt7 
signaling. 

To test this hypothesis, we developed a bi- 
molecular complementation assay in which the 
GPR124 extracellular domain (ECD) and ICD 
are linked by a surrogate anti-GFP VhhGFP4 
nanobody-GFP connector (Fig. 4G). Nanobodies 
are single-domain antibody fragments that have 
been used to rout or misrout intracellular pro- 
teins (28). We repurposed them here as condi- 
tional tethers for signal transduction analysis. 
The highly flexible GFP-VhhGFP4 connector 
acts a buffering module, ensuring that conforma- 
tional information cannot be exchanged between 
tethered partners. On the basis of this idea, we 
designed Gpr124 “!©?-GFP and Gprl25 “““?-GFP 
fusions to which VhhGFP4 fusions will be re- 
cruited (Fig. 4G and fig. S15). The gpr124 vascular 
phenotypes were partially suppressed by co- 
injecting mRNAs encoding Gpr124“"°?-GFP 
and VhhGFP4-ICD (residues 29 to 337) (Fig. 4H). 
Moreover, a chimeric Gpr124 ECD linked to 
cytoplasmic GFP via the transmembrane span 
of the unrelated CD27 receptor was similarly 
active with VhhGFP4-ICD (Fig. 4, G and H). 
We used Gpr125, a closely related aGPCR de- 
void of angiogenic activity, as well as VhahGFP4- 
red fluorescent protein (RFP) as negative controls 
(20-22). 

These results confirm that Gpr124 function 
in Wnt7 signaling does not require signal trans- 
duction across the membrane through confor- 
mational remodeling. Instead, Gpr124 seemingly 
acts in this module as a signaling-deficient trans- 
membrane protein whose activity relies on its 
Reck-binding ECD (22, 24) and its conformationally 
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uncoupled ICD. The function of this latter do- 
main remains to be defined (Fig. 41). 


Gpri124 activity requires 
Dishevelled binding 


We hypothesized that the Gpr124 ICD might op- 
erate through Dishevelled (Dvl), the necessary 


effector of Wnt signaling that interacts with Fz. 
This “Dvl hypothesis” is rooted in the findings 
that Gpr125 physically interacts with Dvl via its 
C-terminal ICD domain (29) and that Gpr124/ 
125 hybrids in which the ICD of Gpr124 is re- 
placed with the ICD of Gpr125 are able to promote 
brain angiogenesis (Fig. 5A) (22). Gpr124/Fz2 hy- 


brids (Gpri24’?) harboring the Fz2 ICD, 
which is known to bind Dvl, were similarly ac- 
tive. By contrast, full-length Fz2 was not. The 
activity of Gpri24'0?'? was dependent on its 
KTxxxW and ETTV Dvl binding motifs (Fig. 5A). 

Moreover, in a noncanonical Wnt signal- 
ing context, overexpression of Gpr125 impairs 
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Fig. 4. Gpr124 function in Wnt signaling does not depend on its GPCR 
structure. (A) Potential Wnt7-specific signal transduction mechanisms. 
(B) Luciferase activity (left) and anti-V5 immunostaining (right) of HEK293- 
STF cells cotransfected with Fz5, Wnt7a, and the indicated receptors and 
co-receptors. (©) Experimental setup for in vivo brain angiogenesis assays 
in zebrafish embryos after RNA injections at the one-cell stage. The red area 
is imaged to analyze the hindbrain central arteries (CtAs). (D) Gpr124 and 
its domain variants. (E) Representative 3D wire diagrams of the cerebro- 
vasculature of WT and gpr124 mutant embryos at 60 hours post fertilization, 
injected with 100 pg of the indicated RNA. Red vessels represent the Gpr124/ 
Reck—dependent intracerebral CtAs that sprout from the gray perineural 
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primordial hindbrain channels (PHBCs); the confocal stacks used to generate 
the wire diagrams are available in fig. S13. (F) Hindbrain CtAs of 
gpr124 morphant embryos at 60 hours post fertilization, injected at the 
one-cell stage with 100 pg of the indicated mRNA. (G) Bimolecular GPCR 
complementation assay strategy. Membrane receptors (Gpr124, Gprl25, and 
Gprl24®°°-TM°"’) inactivated by ICD/GFP substitutions are coexpressed 
with VhhGFP4 fusions to reconstitute a bimolecular adduct through an 
artificial GFP-VhhGFP4 linker. (H) Hindbrain CtAs of gpr124 morphant 
embryos at 60 hours post fertilization, injected at the one-cell stage with 

100 pg of the indicated mRNAs. (I) Gpr124 and its essential protein interactions. 
Scale bar, 10 um. **P < 0.01, ***P < 0.001; data represent mean + SD. 
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convergence and extension movements during 
zebrafish gastrulation, resulting in a mediolateral 
broadening of the anterior-posterior embryonic 
axis as well as synophthalmia or cyclopia (29). 
These phenotypes were linked to the capacity 


of Gpr125 to modulate Dvl distribution through 
its ICD. Gpr124 strictly mimicked Gpr125 in 
these settings (Fig. 5B and fig. S16). 

We therefore tested whether Gpr124 affects 
the intracellular distribution of Dvl. We used 


two distinct cell populations of the zebrafish 
blastula to address this question: the superficial 
enveloping layer (EVL) cells, which maintain 
continuous intercellular contacts, and the deep 
layer (DEL) cells, which establish more discrete 
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Fig. 5. Gpr124 interacts with Dvl through its ICD. (A) Hindbrain CtAs of 
gpr124 morphant embryos at 60 hours post fertilization, injected at the one- 
cell stage with 100 pg mRNA encoding the illustrated receptors. (B) WISH of 
myod1 expression at the seven-somite stage embryos, control, or injected 
with 250 pg of gpr125 or gpr124 mRNA. The red lines indicate the 
mediolateral broadening of the anterior-posterior axis. (C) Intracellular 
distribution of Gpr124-tagRFP and DvI-GFP expressed individually in zebra- 
fish blastulae. (D) Intracellular distribution of Gpr124-tagRFP and Dvl-GFP 
coexpressed in zebrafish blastulae. The cells annotated with an asterisk are 
magnified at right, and the pixel intensity of the green and red channels 
along a virtual clockwise path following the cell cortex from a to b is plotted 
below. In (C) and (D), open and solid arrowheads point respectively to 
Gpr124-negative and Gpr124-positive membrane microdomains in EVL and 
DEL cells. (E) Gpr124 and its ICD variants. (F) Luciferase dose-response 
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values of HEK293-STF cells cotransfected with Wnt7a, Fz1, Reck, and 
increasing amounts of Gpr124 constructs or its ICD variants as illustrated in 
(E). (G) Hindbrain CtAs of gpr124 morphant embryos at 60 hours post 
fertilization, injected at the one-cell stage with 100 pg of the indicated 
mRNA. (H) Intracellular distribution of DvIl-GFP coexpressed with Gpr124 
and its ICD variants in zebrafish blastula EVL cells. (1) same as (H) in 
FZ110 ’~ cells. In (H) and (I), red arrowheads point to DvI-GFP signal at the 
plasma membrane. (J) Anti-FLAG coimmunoprecipitation assays in total 
lysates of cells coexpressing Dvl-GFP and N-terminal FLAG-tagged versions 
of Gpr124 or its ICD variants. (K) Mosaic gene disruption in somatic 
zebrafish embryos by means of injection at the one-cell stage of 150 pg of 
zCas9 mRNA and 50 pg of the illustrated sgRNAs. Resulting hindbrain 
vasculature scored at 48 hours post fertilization. BA, Basilar artery. Scale 
bars, 10 um. **P < 0.01, ***P < 0.001; data represent mean + SD. 
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intercellular junctions (Fig. 5C). Gpr124-tagRFP 
distribution reflects this differing junctional 
organization, with near-uniform membrane signals 
in EVL and discontinuous signals in DEL cells 
(Fig. 5C). When expressed individually, Xenopus 
Dvl-GFP mainly formed cytoplasmic punctae in 
both cell populations, as reported previously (29). 
Coexpressing Gpr124-tagRFP largely redistributed 
Dvl-GFP to the Gpr124-positive membrane sub- 
domains in both EVL and DEL cells, where the 
proteins colocalized (Fig. 5D). 

We generated a collection of Gpr124 trunca- 
tion variants with a range of deletions within 
the 337-amino-acid-long ICD (Fig. 5E) and eval- 
uated their Wnt7/B-catenin functions in vitro 
(Fig. 5F) and in vivo (Fig. 5G). The Gpr124 and 
Gpr125 ICDs contain no obvious motifs except 
for the last four ETTV amino acids that con- 
stitute a canonical PDZ-binding motif (PBM), 
which is also found in a subset of Fz receptors. 
The ETTV tetrapeptide contributed, but was not 
essential, for Gpr124 activity. It was similarly 
dispensable for high-dose Gpr124- or Gpr125- 
induced planar cell polarity phenotypes (fig. S16) 
(29). Analysis of increasingly larger C-terminal 
deletion variants mapped the essential region 
of the Gpr124 ICD to the interval spanning res- 
idues 80 to 213. The activity of the different ICD 
variants exactly matched their capacity to re- 
cruit DvI-GFP in EVL cells (Fig. 5H) and FZ;49 
cells (Fig. 51). This interaction between Gpr124 
and Dvl could also be detected with coimmuno- 
precipitation. In these assays, only Gpr124: var- 
iants harboring the 81-213 region interacted with 
exogenous Dvl-GFP (Fig. 5J) or endogenous DVL2 
(fig. S17) in FZ,;9 /~ cells. This interaction is likely 
direct because purified recombinant GST-Gpr124- 
ICD or GST-Gpri24-ICD“=""" fusion proteins were 
able to pull down in vitro-translated Dvl-GFP 
(fig. S18), as previously reported for Gpr125 (29). 

To test for the endogenous requirement for 
Gpr124 ICD interaction with Dvl, we performed 
gene-disruption experiments in somatic zebra- 
fish embryos by coinjecting zCas9 mRNA and 
single-guide RNAs (sgRNAs) targeting the Gpr124 
coding sequence immediately upstream, within, 
or downstream of the Dvl-binding region (ICD 
residues 80 to 213). Although the injection of 
four out of five sgRNAs predicted to disrupt Dvl 
recruitment to Gpr124 generated embryos that 
lack hindbrain CtAs with a penetrance ranging 
from 13.6 to 70.8%, none of the seven sgRNAs 
targeting Gpr124 downstream of the Dvl-binding 
region generated penetrant brain vascular de- 
fects (Fig. 5K). 

Taken together, these experiments identify 
Dvl as a Gpr124 binding partner that could 
mediate its Wnt7 signaling activities at the plasma 
membrane. Unlike Fz, Gpr124-mediated recruit- 
ment of Dvl at the plasma membrane did not 
yield a detectable increase in phosphorylated Dvl 
levels, an early indicator of Wnt signaling activa- 
tion upstream of B-catenin stabilization (fig. S19) 
(30, 31). This absence of Gpr124-induced Dvl ac- 
tivation is consistent with the experiments shown 
in Fig. 4, D to I, that demonstrated Gpr124 to be a 
conformationally inert Wnt7 signaling mediator. 
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DvI polymers assemble ligand-specific 
Wnt signalosomes by linking Gpr124 

and Fz 

Knockdown of DVL2 by means of small interfer- 
ing RNA (siRNA) impairs Gpr124,/Reck-mediated 
signaling (fig. S20). Because Dvl is an essential 
adaptor of Fz, and Gpr124/Reck signaling relies 
on Fz (fig. S12), such cell-wide loss-of-function 
approaches are however of limited value to probe 
Dvl function specifically as a Gpr124 (and not Fz) 
effector. We therefore used the nanobody strat- 
egy described in Fig. 4G to selectively modulate 
Dvl binding to Gpr124. VhhGFP4-mediated re- 
cruitment of Dvl to Gpr124“!©?-GFP, but not to 
Gprl25“"?-GFP, was sufficient to partially reverse 
the gpr124 mutant vascular phenotype in vivo 
(Fig. 6A). As additional control, injecting either 
component alone or substituting VhahGFP4-Dvl 
with VhhGFP4-RFP did not rescue brain angi- 
ogenesis. These experiments reveal that Dvl is 
sufficient to mediate Gpr124 intracellular func- 
tions in Wnt7 signaling. 

Gpr124, Reck, and Fz/Lrp5/6 have been re- 
ported to form higher-order receptor complexes 
(24). We reasoned that the Gpr124 ICD might 
assemble this complex via Dvl. Dvl molecules 
indeed assemble signalosomes through dynamic 
polymerization (32-34). Because Dvl physically 
interacts with both Gpr124 and Fz, Gpr124 and 
the associated Reck-bound Wnt7 might thus be- 
come trapped in dynamic Wnt signalosomes, 
increasing the local concentration of Wnt7 lig- 
ands available for Fz signaling. 

Wnt signalosomes are readily detected with 
light microscopy as large, punctate structures 
enriched in Dv] that form at or below the plasma 
membrane (33-35). To determine whether Fz 
and Gpr124 codistribute in Wnt signalosomes 
in a Dvl-dependent manner, we first examined 
the localization of individually expressed Fz-GFP 
and Gpr124-tagRFP in DEL cells. Fz4 decorated 
the entire plasma membrane periphery, whereas 
Gpr124-tagRFP accumulated at cellular contacts 
(Figs. 5, C and D, and 6B). This differential mem- 
brane localization was retained upon Dvl expres- 
sion (Fig. 6B). Consistent with their Dvl binding 
capacity, both receptors recruited Dvl-GFP from 
the cytoplasm (Fig. 5C) to their respective mem- 
brane compartments (Fig. 6C). However, when 
Gpr124-tagRFP and Fz4-GFP were coexpressed, 
Fz4-GFP quantitatively relocalized, in a Dvl- 
dependent manner, to the Gpr124-positive in- 
tercellular junctions (Fig. 6D). Gpr124-tagRFP 
and Fz4-GFP colocalized in Wnt signalosome- 
reminiscent punctate structures that were par- 
ticularly evident at EVL cell membranes (Fig. 6E 
and fig. S21). 

We used bimolecular fluorescence comple- 
mentation as an additional assay to test for 
Dvl-dependent Fz/Gpr124 interaction in DEL 
cells. Coinjection of Gpr124-VN,;5 (1152L) and 
Fz1-VC,;5 indeed generated bright junctional 
signals in a Dvl-dependent manner (Fig. 6F), 
demonstrating that Fz and Gpr124 indirectly 
interact via the Dvl scaffold protein. Altogether, 
these data provide a molecular mechanism for 
spatial enrichment of Wnt7 within Fz/Lrp5/6 sig- 
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nalosomes, permitting potentiated and ligand- 
selective cellular responses (Fig. 6G). 


Discussion 


This work provides mechanistic insights onto 
the Wnt decoding capacities of vertebrate cells. 
It also demonstrates that the evolutionarily con- 
strained Wnt structure retained enough diver- 
sity to allow ligand-specific cellular responses, a 
property so far thought to require structurally 
unrelated Frizzled ligands such as Norrin (36). 

These structural insights into Wnt evolution 
and function suggest that additional Wnt decod- 
ing modules exist, enabling fine-tuning of cellular 
behaviors in response to other Wnt or Fz family 
members. The discrete interaction mode of Fz 
and Wnt leaves large surfaces made of evolution- 
ary conserved residues available to accommodate 
additional co-receptors. We therefore propose that 
Wnt decoding modules might have contributed to 
shaping the evolution of the Wnt ligand family. 

The benefit to promiscuous Wnt/Fz inter- 
actions with specificity conferred by accessory 
proteins rather than monospecific Wnt/Fz inter- 
actions might lie in the increased modularity of- 
fered by the binary system. A single-component 
system would be limited to an on-or-off signaling 
output. The two-component system described 
here can, context-dependently, achieve cell- 
autonomous Wnt signaling inhibition or act 
as a tunable rheostat amplifying the signaling 
output of specific Wnt ligands. 

A salient molecular property of the Wnt7 mod- 
ule is the use of Dvl as a common Gpr124 and Fz 
adaptor. It is tempting to propose that taking 
advantage of Fz-associated scaffold proteins such as 
Dvl could constitute a generic mechanism for Wnt/ 
Fz modifiers. Accordingly, cells recurrently tailor 
their responses to Wnt by reshaping the molecular 
composition of the Dvl-associated proteins, includ- 
ing regulatory kinases, E3 ubiquitin ligases, and 
components of the endocytic machinery (32-34). 

Our findings have clinical implications. The 
pleiotropic functions of Wnt signaling in health 
and disease make this pathway a conspicuous 
yet intrinsically challenging therapeutic target. 
Manipulating the Wnt/B-catenin pathway at the 
level of its cytosolic or nuclear components har- 
bors high potential for systemic effects with un- 
desirable outcomes across a range of tissues 
(37). Interventions focused on specific Wnts, Fz 
receptors, or other signaling components at the 
cell membrane might in principle be more se- 
lective and hence better positioned to lead to 
clinically viable strategies (4, 38-40). The mech- 
anism uncovered here expands the prospects for 
specific Wnt-targeted interventions. In particu- 
lar, by giving molecular insights at the heart of 
Wnt7-specific signaling, it provides an opportu- 
nity for the targeted treatment of human brain 
disorders with neurovascular involvement, in- 
cluding stroke and brain cancer (41). 


Materials and methods 
Zebrafish lines 


Zebrafish (Danio rerio) were maintained at 
28°C on a 14 hours light/10 hours dark cycle. 
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Fig. 6. Dvl polymers assemble ligand-specific Wnt signalosomes by 
linking Fz and Gpri24. (A) Hindbrain CtAs of WT or gpr124 morphant 
embryos at 60 hours post fertilization, injected at the one-cell stage 
with 50 pg of the indicated mRNA. The injected doses of mRNA were 
reduced compared with Fig. 4H so as to avoid the developmental 
toxicity of VhhGFP4-Dv! mRNA injections. Representative hindbrain 
vasculature is shown on the right. (B) Intracellular distribution of 
Fz4-GFP and Gpr124-tagRFP expressed individually or in the presence of 
DvI in zebrafish blastula DEL cells. (C) Intracellular distribution of 
DvI-GFP coexpressed with Fz4 or Gpr124 in zebrafish blastula DEL cells. 
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Signaling-deficient module 


Potentiated and ligand selective 
Wnt/B-catenin signaling 


(D) Intracellular distribution of coexpressed Fz4-GFP and Gpr124- 
tagRFP in the absence or presence of Dvl in zebrafish blastula DEL cells. 
The cells annotated with an asterisk are magnified at right, and the pixel 
intensity of the green and red channels along a virtual clockwise path 
following the cell cortex from a to b is plotted below. (E) Same as (D) in 
EVL cells. (F) BiFC signals in zebrafish DEL cells expressing Gpr124- 
VN155 (1152L) and Fz1-VC155 in the presence or absence of Dvl 
overexpression. (G) Integrated model for Gpr124/Reck-dependent, 
Wnt7-specific Fz signaling. Scale bars, 10 um. ***P < 0.001; data 
represent mean + SD. 
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Embryos were obtained and raised under stan- 
dard conditions in accordance with European 
and national ethical and animal welfare guide- 
lines (protocol approval number: CEBEA-IBMM- 
2017-22:65). Staging was performed according to 
Kimmel et al. (42). The transgenic and mutant 
lines used in this study are 7g(kdrl:EGFP)***? 
(43), Tg(kdrl: HRAS-mCherry)*°°> (44) and 
epr12459%* (22). 


Morpholinos, RNA constructs, 
and microinjection 


Splice-blocking morpholinos targeting gpr124 
(ACTGATATTGATTTAACTCACCACA) (22) were 
purchased from Gene Tools and injected at the 
one-cell stage at 2 ng. Synthetic mRNAs were 
transcribed from NotI linearized pCS2 using the 
mMessage mMachine SP6 Kit (Thermo Fisher 
Scientific) and injected into one-cell stage zebra- 
fish embryos. 


Somatic gene disruption in zebrafish by 
CRISPR/Cas9 


The gpri24-targeting se¢RNAs constructs (sgRNA 
1-12) were obtained by in vitro annealing of the 
following primers (1-Fo: TAGGTGGCATGCTA- 
CAAAAAGC and 1-Rv: AAACGCTTTTTGTAG- 
CATGCCA; 2-Fo: TAGGTTTGGGTCCGTAG- 
GTGGA and 2-Rv: AAACTCCACCTACGGACC- 
CAAA; 3-Fo: TAGGGTAATGGTTTGGGTCCGT 
and 3-Rv: AAACACGGACCCAAACCATTAC; 4- 
Fo: TAGGAGAAGAAGAGTAATGGTT and 4-Rv: 
AAACAACCATTACTCTTCTTCT; 5-Fo: TAG- 
GAAGCTTGTAGTAGAACCA and 5-Rv: AAAC- 
TGGTTCTACTACAAGCTT; 6-Fo: TAGGCCA- 
TTCTGGCCCTCTGAA and 6-Rv: AAACTTCA- 
GAGGGCCAGAATGG; 7-Fo: TAGGCCAACG- 
GGCGGCACAGAG and 7-Rv: AAACCTCTGT- 
GCCGCCCGTTGG; 8-Fo: TAGGGGCACAGA- 
GGGGAAGGAC and 8-Rv: AAACGTCCTTCC- 
CCTCTGTGCC; 9-Fo: TAGGTAGTGATGGAG- 
GTAGCAG and 9-Rv: AAACCTGCTACCTC- 
CATCACTA; 10-Fo: TAGGTGCCTTCGGTAG- 
CACGTA and 10-Rv: AAACTACGTGCTACCG- 
AAGGCA; 11-Fo: TAGGGGTATGAAGAGCAG- 
GGTA and 11-Rv: AAACTACCCTGCTCTTCA- 
TACC; 12-Fo: TAGGTCGGTGGGTATGAAGAGC 
and 12-Rv: AAACGCTCTTCATACCCACCGA) and 
cloning into pT7-gRNA (Addgene #46759), as 
described in (45). sgRNA were transcribed from 
the BamHI linearized vector pT7-gRNA using 
the MEGAshortscript T7 kit (Thermo Fisher 
Scientific). The synthetic Cas9 mRNA was tran- 
scribed from the Xbal linearized vector pT3TS- 
nls-zCas9-nls (Addgene #46757) using the mMessage 
mMachine T3 Kit (Ambion). sgRNAs 1-12 (50 pg) 
and nls-zCas9-nls mRNA (150 pg) were in- 
jected into one-cell stage zebrafish embryos. 
The brain vasculature was analyzed and imaged 
at 48 hpf. 


Whole-mount in situ hybridization 
(WISH) 


Digoxigenin-labeled antisense riboprobes were 
synthesized with the digoxigenin (DIG) RNA 
labeling kit (Roche Diagnostics GmbH). Em- 
bryos were fixed in 4% paraformaldehyde 
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overnight at 4°C and whole-mount in situ hy- 
bridization was performed using digoxigenin- 
labeled myodI riboprobes as previously described 
(46, 47). 


Expression plasmid constructs 


Wnt, Frizzled, Reck and Gpr124 and their var- 
iants used in STF and PLA assays were expressed 
from the CMV promoter of the pCS2 plasmid 
except for the following plasmids: pRK5-Lrp5- 
rhotag, Fz4-GFP (Addgene #42197), Fzl (Addgene 
#42253) and Fz5 (Addgene #42267), all kindly 
provided by J. Nathans. Single-point mutation 
variants, deletions and chimeras were generated 
using In-Fusion cloning (ST0345, Takara) and 
tandem overlapping PCR products. All constructs 
were confirmed by Sanger sequencing. Through- 
out this study, we used Xenopus Dvl-GFP (Addgene 
#16788, from the R. Moon laboratory), zebrafish 
Gpr124 and Gpr125 as well as mouse Reck and 
untagged Wnt ligands (22). The collection of ac- 
tive human Wnt-V5 ligands was kindly provided 
by the Xi He laboratory via Addgene (#43807 to 
#43825) (48). The activity of the V5-tagged lig- 
ands was verified by STF assays (fig. S22). Reck 
deletion variants correspond to the following 
amino acids of mouse Reck (NP_057887.2): 
Reck*™'; 46-93; Reck““: 113-150; Reck““™: 
160-206; Reck*°“*; 295-272; Reck*“: 301-347; 
Reck**"*; 46-347; Reck“°®?: 352-484 and 
Reck**““4L, 636-798. Gpr124 deletion variants 
correspond to the following amino acids of 
zebrafish Gpr124 (NP_001305000.1): Gpri244"°: 
68-725 and Gprl244™?, 764-1030. The Gprl24 
ICD used in deletion, swapping or fusion con- 
structs corresponds to residues 1058-1367 of the 
full length protein (28-337 of the ICD), unless 
otherwise indicated. The ICD of Gpr124 is 
defined as the segment that follows the last 
transmembrane segment of the seven pass trans- 
membrane protein as non-ambiguously defined 
by the InterPro server of EMBL-EBI (based on 
the Phobius transmembrane topology and sig- 
nal peptide predictor from the Stockholm Bio- 
informatics Centre). The ICD of zebrafish Gpr125 
(NP_001289153) and of zebrafish Fz2 (NP_571215.1) 
were similarly defined and correspond to the 
residues 1054-1346 and 526-550, respectively. 
Gpr124"°?-TM©>?’-GFP was generated by re- 
placing the 7-TM span of Gpr124 (residues 
743-1030) by the TM of mouse CD27 (NP_ 
001028298.1) (IFVTFSSMFLIFVLGAIL). The 
VhhGFP4-ICD construct was generated by 
fusing the Gpr124 ICD to the C terminus of 
VbhGFP4 [kindly provided by Markus Affolter 
(28)]. Gprig4iCPF4KTeaW/AETIV was generated 
by deleting the KTxxxW (KTLHSW) and ETTV 
motifs within the ICD of zebrafish Fz2. The 
BiFC constructs were subcloned from pBiFC- 
VC155 (Addgene #22011) and pBiFC-VN155 (1152L) 
(Addgene #27097). The HA tag was inserted after 
residue 22 in murine Reck, residue 47 in zebrafish 
Gpr124 and residue 22 in murine Fz5. The FLAG 
tag was inserted after residue 47 in zebrafish 
Gpr124. We thank the numerous colleagues that 
facilitated this work by depositing their plas- 
mids at Addgene. 
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Cell culture and HEK293(T) mutant 

cell lines 

HEK293T cells were obtained from ATCC (CRL- 
3216) and the HEK293 STF cell line was kindly 
provided by J. Nathans. WT and mutant cells 
were cultured in DMEM/F12 medium (Lonza) 
supplemented with 10% fetal bovine serum and 
maintained at 37°C in a humidified incubator 
equilibrated with 5% CO.. GPR124 and RECK 
were genetically inactivated using CRISPR/Cas9 
approaches in HEK293-STF cells and LRP5, 
LRP6 and FZ were inactivated in HEK293T 
cells. The mutant cell lines were obtained by 
the iterative CRISPR/Cas9-mediated mutagen- 
esis strategy illustrated in Fig. 1A. CRISPR/ 
Cas9 guide sequences were designed using the 
http://crispr.mit.edu website and were cloned 
into pSpCas9(BB)-2A-GFP (49). The top 1% of 
GFP* cells was isolated by FACS (ArialII, BD 
Biosciences) 48 hours after transfection and 
distributed in 96-well plates for clonal expan- 
sion. In order to facilitate the genetic character- 
ization of the mutant cell lines, clones generating 
multiple-peak derivative melt curves in high- 
resolution melt analyses were first counter- 
selected. For each gene, two independent PCR 
products flanking the target site (300-1000 bp 
PCR products centered on the PAM site) were 
gel purified and characterized by Sanger se- 
quencing on both strands. The absence of WT 
alleles in the uncloned PCR product was verified 
and the number of different edited alleles (1, 2 or 
3 in hypotriploid HEK293 cells) was deduced 
from the complexity of the chromatograms (fig. 
Sl). For each gene, the two independent PCR 
products were then cloned in pCR’™-Blunt II- 
TOPO” (Thermo Fisher Scientific) and individual 
colonies were sequenced until each edited allele 
was sequenced at least twice from each individ- 
ual PCR amplicon. The number of analyzed 
colonies for the FZz39! ~ cell line is provided in 
table S1. The sequencing chromatograms form 
the identified individual clones were then aligned 
to ensure that the allelic complexity of the un- 
cloned PCR can exhaustively be accounted for by 
the identified alleles (fig. S1). Finally, to exclude 
allelic exclusion by PCR amplification bias, the 
absence of WT alleles in the FZ;.;9 /~ cell line was 
verified by whole exome sequencing. 


STF dual luciferase assay 


Cells were plated into 96-well plates and trans- 
fected after 24 hours in triplicate with Lipofect- 
amine 2000 (Thermo Fisher Scientific). The 
amount of plasmid DNA transfected per well 
was optimized for each expression vector as 
follows: Renilla luciferase (0.5 ng), Wnt ligands 
(20 ng), Fz receptors (5 ng), Lrp5 (2.5 ng), Gpr124 
(10 ng), Reck (5 ng) and Dkk-1 (10 ng) unless 
otherwise indicated. The total amount of DNA 
was adjusted to 100 ng per well with the empty 
pCS2 vector. Dual luciferase assays were performed 
using the STF cell line or by co-transfecting 20 ng 
of M50 Super 8x TOPFlash plasmid (Addgene 
#12456). Cells were harvested in passive lysis buffer 
(E1980, Promega) and the activities of the Firefly 
and Renilla luciferases were measured sequentially 
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using the Dual-Luciferase Reporter Assay system 
(E1980, Promega) 48 hours post transfection. 
The competition assays in Fig. 1G and fig. S5 
were performed by plating cells as a 1:1:1 mix- 
ture at 90% confluency in 96-well plates 24 hours 
after transfection. Luciferase activity was mea- 
sured 24 hours after co-culture. For siRNA, cells 
were plated into 96-well plates and transfected 
after 24 hours in triplicate using Lipofectamine 
2000 (Thermo Fisher Scientific) with 1 pmol (per 
well) of Dvl2 or control siRNAs (SASI_HSO1_ 
00104204 and SICOO1, Sigma-Aldrich) together 
with the indicated plasmids. 


Immunofluorescence and proximity 
ligation assay 


Cells were grown in glass-coated chambers 
(IBIDI) and transfected after 24 hours with 
Lipofectamine 2000 (Thermo Fisher Scientific). 
Cells were fixed with 4% paraformaldehyde for 
10 min at room temperature (RT) 48 hours post 
transfection. For immunofluorescence staining 
(IF), cells were blocked in 1% BSA-PBS for 30 min 
before being exposed to primary antibodies for 
1 hour at RT. After three PBS washes, cells were 
incubated with secondary antibodies for 1 hour 
at RT. For anti-V5 staining of Wnt ligands, cells 
were additionally washed for 10 min in PBS 0.1% 
Tween 20 before incubation with the secondary 
antibody solution. For the proximity ligation as- 
say (PLA) (Sigma-Aldrich), cells were blocked for 
30 min at 37°C with the Blocking solution pro- 
vided by the manufacturer before being incu- 
bated with the primary antibodies for 1 hour at 
RT. Cells were washed three times and incubated 
with the PLA probes anti-rabbit PLUS and anti- 
mouse MINUS for 1 hour at 37°C. After two PBS 
washes, cells were incubated with the Duolink 
Ligation solution for 30 min at 37°C. After two 
PBS washes, cells were incubated with the Duolink 
Amplification solution for 100 min at 37°C. The 
following antibodies were used: monoclonal 
mouse anti-V5 (R96025, Thermo Fisher Scien- 
tific) at 1:500 for IF and PLA, purified polyclonal 
rabbit anti-HA (H6908, Sigma-Aldrich) at 1:400 
for IF and PLA and anti-mouse Alexa488- 
conjugated secondary antibody (Thermo Fisher 
Scientific) at 1:5000. Cells were stained for 2 min 
with Hoechst diluted to 10 pg ml in PBS. 


Western blot, Dot blot, GST pull-down 
and coimmunoprecipitation 


The following antibodies were used with an 
overnight incubation at 4°C: rabbit anti-DVL2 
(1:1000, #32245, Cell Signaling Technology), rab- 
bit anti-DVL2-phospho T224 (1:1000, ab124941, 
Abcam), mouse monoclonal anti-V5 (1:1000, R96025, 
Thermo Fisher Scientific), mouse monoclonal anti- 
B-actin-peroxidase (1:50000, A3854, Sigma-Aldrich), 
chicken anti-GFP (1:5000, GFP-1010, Aves), and 
mouse monoclonal anti-FLAG M2 (1:1000, F1804, 
Sigma-Aldrich). 

Dot blot analyses were performed according 
to manufacturer’s protocol with a BioDot SF 
apparatus (Bio-Rad). Serial dilutions of super- 
natant were spotted onto a nitrocellulose mem- 
brane (GE Healthcare). After drying, the membrane 
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was incubated with the antibodies as described 
above. For the coimmunoprecipitations assays, 
HEK293T were collected 48 hours after trans- 
fection from six-well plates and resuspended 
after two PBS washes in lysis buffer [150 mM 
NaCl, 25 mM Tris (pH 7.5) and 1% IGEPAL CA- 
630 (Sigma-Aldrich)] containing EDTA-free pro- 
tease inhibitor cocktail (Roche) for 30 min at 4°C. 
After centrifugation at 20000 g for 10 min, the 
supernatant was incubated with 20 ul of anti- 
FLAG M2 affinity gel (A2220, Sigma-Aldrich) 
overnight at 4°C. Beads were washed five times 
with the lysis buffer and boiled in 2x Laemmli 
Sample buffer. For GST pull-downs, the coding 
sequence of Gpr124 ICD and its AETTV variant 
were fused downstream of GST sequences into 
pGEX-6P1 (GE Healthcare). The GST-fusion pro- 
teins were induced in Escherichia coli BL21 by 
exposure to 0.1 mM of IPTG in LB medium at 
37°C for 4 hours. Cells were subsequently lysed 
in a cell disruptor (M110S, Microfluidics) and 
after centrifugation, the GST-fusion protein from 
the supernatant were immobilized on Glutathione 
Sepharose beads (17-0756-01, GE Healthcare). The 
Dvl-GFP and GFP constructs were transcripted 
using the mMESSAGE mMACHINE SP6 Tran- 
scription Kit (Thermo Fisher Scientific) and in 
vitro translated using the Rabbit Reticulocyte 
Lysate System provided by Promega (L4.960) in 
the presence of S*-methionine (NEGOO9TOOIMC, 
PerkinElmer). The beads were incubated with 
the radiolabeled proteins in PBS for 3 hours at 
4°C with gentle mixing and then washed five 
times with cold PBS. The bound complex was 
resuspended in 2x Laemmli buffer, separated by 
SDS-page and analyzed by fluorography. 


Microscopy and images processing 


Cells and zebrafish embryos were imaged with 
a LSM710 confocal microscope and images were 
processed in ImageJ. Images of eye fusion pheno- 
types were taken on a Leica M165 FC. Brain vas- 
culature renderings were generated using Imaris 
software (BitPlane). 

The following method was used for the quan- 
tification of the PLA signal: The in situ PLA signals 
are seen as bright fluorescent dots of character- 
istic appearance. The signal lined the cell surface, 
as expected given the membrane localization of 
the receptors. For quantification, several images 
were acquired on LSM710 confocal microscope 
with a 20x objective. ImageJ was used to generate 
binary images by global thresholding and to de- 
termine the ratio of the total area occupied by the 
PLA-positive pixels to the DAPI-positive pixels, 
the latter being indicative of the number of cells 
in the field of view. The threshold values were 
manually adjusted to reflect the nuclear localiza- 
tion of DAPI and the focal membrane PLA signals. 
Identical threshold values were used for all images. 
On the dot plot, each dot represents the ratio of one 
image. Each image typically contained 50-100 cells. 


Structural modeling 


The structure of Xenopus Wnt8a (PDB ID: 4FOA) 
(13) was used as a starting model for Wnt7a. 
Missing residues and substitutions were mod- 
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eled using the program Modeller (50). The mod- 
eling strategy included accounting for existing 
disulphide bridges as observed in XWnt8a. The 
initial best models were subjected to a conjugate- 
gradient energy minimization in vacuum with 
the Ca restrained and then freed in a second 
minimization step. These models were then em- 
bedded in a water box and electric neutrality was 
achieved by adding Na* counter ions at 150 mM. 
The whole system was again energy minimized 
in 3000 steps. The molecular dynamics simu- 
lation was carried out for 0.5 ns with the pro- 
gram NAMD 2.7 at constant temperature (310 K) 
and constant pressure (1 atm), with periodic 
boundaries and using CHARMM36 as force field 
(51). A time step of 2 fs was used to integrate the 
equations of motion. The short-range interac- 
tions were cut at 12 A and the smooth-particle 
mesh Ewald method was used to calculate elec- 
trostatic interactions. Hydrogen atoms were 
constrained using the SHAKE algorithm. The 
resulting model is an average representation of 
the stable simulation. 


Recombinant Reck-Fc fusions and 
synthetic peptides 

The HA-tagged CK domain of Reck and its indi- 
vidual CK motif deletion variants were fused 
at the N terminus of the Fc region of human 
IgGl. The fusion vector was kindly provided by 
J. Nathans (24). Fc fusion proteins were recovered 
in serum-free FreeStyle 293 Expression Medium 
(Thermo Fisher Scientific) from the supernatant 
of HEK293T cell cultures 72 hours post transfec- 
tion with Lipofectamine 2000 (Thermo Fisher 
Scientific). After collection, the supernatants 
were submitted to Protein G affinity purifica- 
tion (Protein G Sepharose 4 Fast Flow, Sigma- 
Aldrich). After acidic elution, protein purity was 
assessed by Coomassie Blue staining (Fig. 3A). 
Synthetic Wnt linker peptides were obtained from 
Chinapeptide Co., Ltd at purities of over 90%. 


Isothermal titration calorimetry 


ITC titrations were carried out on an Affinity ITC 
(TA Instruments). Prior to the measurement, Reck- 
CK-Fc and all Reck-CK-Fc fusions were dialyzed to 
Tris-NaCl buffer (50 mM Tris pH8, 300 mM NaCl). 
In each case Wnt-derived peptides were prepared 
with buffer from the last step of protein dialysis. 
The samples were filtered and degassed before 
being examined in the calorimeter and the titra- 
tions were performed at 25°C. All the experiments 
consisted of injection of constant volumes of 2 uL 
of titrant into the cell (200 uL) with a stirring rate 
of 75 rpm. Nominal sample concentrations were 
between 20 uM and 40 uM in the cell and 400 pM 
to 1.0 mM in the syringe. Actual sample concen- 
trations were determined after dialysis or buffer 
exchange by measurement of their absorption at 
280 nm or by the BCA method. All data were 
analyzed using the MicroCal Origin ITC 7.0 and 
NanoAnalyze software packages. 


Atomic force microscopy 


Glass coverslips coated with a thin gold layer 
were cleaned in an ultraviolet radiation and 
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ozone (UV-O) cleaner (Jetlight) for 15 min and 
subsequently immersed overnight in an ethanol 
solution containing 1 mM 16-mercaptododeca- 
hexanoic acid and 1-mercapto-l-undecanol at 
a 1:99 volumetric ratio. Substrates were then 
rinsed with ethanol, dried with N» and added to 
a solution containing equal volumes of 20 mg ml 
N-hydroxysuccinimide (NHS) and 50 mg ml” 
1-ethyl-3-(3-dimethylaminopropyl)-carbodiimide 
(EDC) for 30 min. The obtained NHS activated 
surfaces were rinsed with ultrapure water and 
incubated with 100 ul of a 10 ug mI Protein G 
solution for 1 hour at RT. Samples were then 
washed with washing buffer (3 x 5 min) and 
further incubated with 100 ul blocking buffer 
for 1 hour at RT. Finally, 50 ul of a 0.2 ug ml? 
Reck-CK-Fe solution was added to the sub- 
strates for 1 hour, rinsed with washing buffer 
and subsequently used for AFM experiments. 
Single-molecule force spectroscopy (SMFS) mea- 
surements were performed in PBS buffer at RT 
using a Nanoscope VIII Multimode AFM (Bruker). 
Triangular AFM cantilevers (MSCT, Bruker) with 
silicon nitride tips and a nominal spring constant 
between 0.01-0.06 N m™ were used. Cantilevers 
were calibrated at the end of each experiment 
using the thermal noise method (52). Function- 
alized tips were derivatized using a NHS-PEG.,- 
Malemide linker following a protocol described 
elsewhere (53) to covalently attach the Pep7b 
linking the cysteine residue added at the C ter- 
minus. After functionalization, the cantilevers 
were washed with PBS (3 x 5 min) and stored in 
individual wells of a multiwell dish containing 
2 ml of PBS per well at 4°C until used in AFM 
experiments. Force-distance curves were recorded 
as 32x32 pixels arrays over 1 x 1 um? areas, using 
an applied force of 250 pN, a contact time of 0.25 s 
and a constant approach and retraction speed 
of 1m s*. For dynamic force spectroscopy mea- 
surements, the retraction speed of the cantilever 
was varied as follows: 20 nm s‘, 100 nm s”’, 
200 nms77,lums?,2ums',10 ums and 
20 um s |. Typically, at least 2000 force-distance 
curves were performed for each cantilever at a 
particular retraction speed. The collected data 
were analyzed using the Nanoscope Analysis 
software (Bruker). The retraction segment of 
each curve was analyzed and unbinding events 
were considered as specific if they occurred at a 
distance between 5-50 nm from the contact 
point. The minimum adhesion force was further 
used to calculate binding probabilities and build 
force distribution histograms. To reconstruct the 
energy landscape of the measured interactions, 
loading rates were calculated from the force vs 
time curve, as the slope of the adhesion event 
before the tip cantilever jumps off to surface. The 
dependency of the force with the loading rate 
was then plotted in dynamic force spectroscopy 
plots. 


Statistical analysis 


Statistical analysis was performed using GraphPad 
software. Data represent mean + SD. p-values 
were calculated by the one-way ANOVA (post 
hoc Dunnett’s test) and Student’s ¢ test for 
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multiple and single comparisons of normally 
distributed data (STF and PLA assays) and by 
the Kruskal-Wallis (post hoc Dunn's test) for 
multiple comparisons of non-normally distrib- 
uted data (CtA quantifications); *P < 0.05; **P< 
0.01; ***P < 0.001. 
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Stereodivergent synthesis of 
1,4-dicarbonyls by traceless charge- 
accelerated sulfonium rearrangement 


Dainis Kaldre*, Immo Klose*, Nuno Maulidet 


The chemistry of the carbonyl group is essential to modern organic synthesis. The 
preparation of substituted, enantioenriched 1,3- or 1,5-dicarbonyls is well developed, as 
their disconnection naturally follows from the intrinsic polarity of the carbonyl group. By 
contrast, a general enantioselective access to quaternary stereocenters in acyclic 
1,4-dicarbonyl systems remains an unresolved problem, despite the tremendous 
importance of 2,3-substituted 1,4-dicarbonyl motifs in natural products and drug scaffolds. 
Here we present a broad enantioselective and stereodivergent strategy to access acyclic, 
polysubstituted 1,4-dicarbonyls via acid-catalyzed [3,3]-sulfonium rearrangement 

starting from vinyl sulfoxides and ynamides. The stereochemistry at sulfur governs the 
absolute sense of chiral induction, whereas the double bond geometry dictates the relative 


configuration of the final products. 


uch of organic synthesis revolves around 

the chemistry of the carbonyl group. Con- 

catenations of more than one carbonyl 

function are given special attention and 

referred to by the relative disposition of 
the two moieties (e.g., 1,3; 1,45 or 1,5), along with 
strategies for their direct preparation. Highly effi- 
cient approaches exist for the preparation of sub- 
stituted, enantioenriched 1,3- or 1,5-dicarbonyls, 
as their disconnection naturally follows from the 
intrinsic polarity of the carbonyl group. By con- 
trast, the 1,4-dicarbonyl pattern remains chal- 
lenging to access, even though 2,3-substituted 
1,4-dicarbonyl motifs are commonly found in 
numerous natural products (7) and drug scaf- 
folds (2, 3) and are key synthons for many named 
reactions in organic chemistry (4, 5) (Fig. 1A). 
Most of the current methods for the synthesis of 
these motifs involve direct formation of the cen- 
tral, C2-C3 bond via oxidative coupling (/, 6-8) 
or Umpolung strategies (9, 10) involving enolate 
alkylation (17) (Fig. 1B). However, whereas the 
first family of methods fails to properly address 
stereoselectivity, the second one is limited in 
structural flexibility or requires multistep prep- 
aration of two chiral starting materials while being 
only moderately diastereoselective. Rearrangements 
have been used to access 1,4-dicarbonyls (72, 13), 
but these methods are strongly limited to certain 
molecular scaffolds. A general enantioselective 
route to quaternary stereocenters in acyclic 


Fig. 1. Relevance and synthesis of 
1,4-dicarbonyls. (A) 1,4-dicarbony! motifs in 
bioactive substances. (B) Current strategies and 
common limitations. (©) Highly modular and 
stereoselective synthesis of vinyl sulfoxides. 

(D) Stereodivergent approach for the enantio— and 
diastereoselective synthesis of 1,4-dicarbonyls. 


Kaldre et al., Science 361, 664-667 (2018) 


1,4-dicarbonyl systems remains an open problem 
C, 14-16). 

Here we report a broad strategy to access acyclic, 
polysubstituted 1,4-dicarbonyls via charge-accelerated 
sulfonium rearrangement (17-19). Our approach 


uses highly enantioenriched alkenylsulfoxides— 
readily available substrates in two steps from com- 
mercially available menthyl sulfinates (Fig. 1C) 
(20-23). In combination with ynamides (24) and 
a Bronsted acid catalyst, these undergo a [3,3]- 
sulfonium rearrangement to form a thionium 
intermediate that is hydrolyzed in situ to give the 
respective aldehydes or ketones in a traceless man- 
ner (Fig. 1D). This catalytic approach allows the 
preparation of tertiary and quaternary centers to 
access each and every diastereomer and enantio- 
mer of the 1,4-dicarbonyl products at will and with 
high stereopurity. 

Our investigation started with the use of 
(E)-vinyl sulfoxide la (22, 23), which enabled 
the selective formation of syn-2,3-disubstituted 
1,4-dicarbonyls (Fig. 2A). During optimization 
studies, both the addition of water to the reaction 
mixture and the use of oxazolidinone-derived 
ynamides afforded superior results (see supple- 
mentary materials for more details). Several ali- 
phatic ynamides afforded the desired aldehydes 
in good yields and high diastereomeric ratio (d.r.), 
and the stereoselectivity was further improved 
when aromatic ynamides were used (2d, 2e). 
Numerous base-sensitive functional groups such 
as esters (2), nitriles (2g), imides (2h), and ketones 
(2i), as well as primary chlorides, aldehydes, and 
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A E-vinylsulfoxide gives Syn-1,4-dicarbonyls 


fe) er" 0 ot . equiv) L (e) Rs 
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ee ae ene aE ee ee See tea ESTO ES | ane ae te ene ne ee eee ER CE ee Pee 
Me Me Me Me Me Me 


e) 
2a 2b 2c 2d cl 2f 
85% (72%) 78% (58%) 87% (60%) 54% (36%) 2e 81% (66%) 
dr. 8:4 dir. 8:1 dir. 9:1 dir. 13:1 66% (52%) dr. 7:4 
ee > 98% ee > 98% ee > 98% ee > 98.5% dr. 12:1, ee > 99% ee > 96% 
Me Me Me X-Ray crystal structure ie Me 
fo) pi fe) s fe) Pi oN Me MeN Me fe) J fe) J 


fe) : fe) : fe) : ) : fe) ‘ fc) : ) : 
ony gon gon gon raaanal corey NOW 
fe) % 
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> Oo > 7)_ O - @ = j, 2 ; 2 
' arn Gar a OO 
nZ Ph~ So Ph Ph~ SO Me Ph Ph 
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2g 2h 2i 2j 2k al 2m* 
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dr 8:1 d.r. 8:1 dir. 7:1 d.r. 16:1 dir. 15:1 d.r. 10:1 d.r. > 20:1 
ee > 98% ee > 95% ee > 95% ee > 99% ee > 99% ee 93.5% 


B: Z-vinylsulfoxide gives Anti-1,4-dicarbonyls 
TfaNH (0.35 equiv) 
H20 (3.0 equiv) 


CHoCle, 0 °C, 2.5 h 


20 2p 2r 2s* 
75% (68%) 80% (70%) 70% (56%) 62% (55%) 60% (53%) 76% 
dir. 8:1 dir. 8:1 dir. 8.5:1 dr. 8:1 dir. 14:1 d.r. 14:1 
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anti Lg J — Z-sulfoxide qN HALAL anti 
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(2S,3S)-2a ent-(2R,3R)-2a 
90% (72%) 76% (71%) 
dir. 8:1 d.r. 8:1 
ee 98% ee > 98% 


Fig. 2. Substrate scope. (A) syn-1,4-dicarbonyls. (B) anti-1,4-dicarbonyls. 
(C) Access to all possible stereoisomers of a 1,4-dicarbonyl product. Unless 
otherwise indicated, reactions were run on 0.1- to 0.2-mmol scale. Yields were 
determined by #H-NMR (nuclear magnetic resonance) using an internal 
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standard (isolated yields shown in parentheses). Diastereomeric. ratios were 
determined by H-NMR analysis of the crude product. Enantiomeric excess (ee) 
determined via high-performance liquid chromatography (HPLC). * Isolated 
yield. Modified conditions used, see supplementary materials for details. 
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A Vinyl sulfoxide geometry dictates relative and absolute configuration for all-carbon quaternary stereocenters 


mOct 
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e) Me 
oct’? py 9 


R H20 (3 equiv 


E-sulfoxide 


R= Me, 2t 91% (60%), d.r. 6:1, ee >98% 
R = Ph, 2v 79% (71%), d.r. 8:1, ee >98% 


B Scope of ynamides 


2x 2y 
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8 QO N 
Oe R  ThNH (0.2 equiv) 
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Ph 


Fig. 3. Synthesis of all-carbon quaternary products. (A) Preparation of 
quaternary centers. (B) Scope of ynamides. (C) Scope of vinyl sulfoxides, 


including fluorine and trifluoromethy! derivatives. Un 


alkynes (see supplementary materials), were all 
well-tolerated. Sterically more demanding sub- 
stituents on the sulfoxide greatly enhanced the 
stereoselectivity, delivering the desired products 
with high diastereoselectivity (2j and 2k). When 
using ynamides with a chiral auxiliary, we ob- 
served matched-mismatched pairings leading to 
diastereomeric ratios of 10:1 (matched, compare 
21) and 3:1 (mismatched, compare supplemen- 
tary materials) respectively. The products were 
isolated as free aldehydes ready for further func- 
tionalization (see below), and ketones (2m) could 
also be accessed using an o-substituted sulfoxide 
(Ro # H). 

Conversely, the use of (Z)-sulfoxides resulted 
in the formation of anti-1,4-dicarbonyl products 
(Fig. 2B). In this case, the use of alkyl vinyl 
sulfoxides led to increased yields and stereo- 
selectivities (see supplementary materials for 
details). Consistently high diastereo- and enan- 
tioselectivities were observed. Thus, all four 
possible isomers (2a, 2n, and their respective 
enantiomers) of a 1,4-dicarbonyl product are 
accessible by this stereodivergent approach, in 
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ess otherwise indicated, 


high yield and stereoselectivity, by simply switch- 
ing between double bond geometry and sulfoxide 
stereoisomers, as demonstrated in Fig. 2C (25-29). 

In our stereochemical model, the charge- 
accelerated sulfonium rearrangement is consistent 
with a chair-like transition state. The (£)-vinyl 
sulfoxides bias all substituents into a pseudoequa- 
torial orientation, whereas their (Z)-counterparts 
mandate that the R, substituent (Fig. 2C) occupies 
a pseudoaxial orientation directing the substitu- 
ents to an anti-relationship in a final 1,4-dicarbonyl 
structure. The stereochemistry at sulfur governs 
the absolute sense of chiral induction, whereas 
the double bond geometry in turn dictates the 
relative stereochemistry of the final products. 

Having successfully demonstrated diastereo- 
and enantiodivergence, we turned our attention 
to B,8-disubstituted alkenylsulfoxides to access 
all-carbon quaternary stereocenters in a stereo- 
selective fashion. As shown in Fig. 3, A and B, the 
easily controlled sulfoxide double bond geometry 
correspondingly induces formation of the quater- 
nary carbon stereocenter with high diastereose- 
lectivity and perfect enantiocontrol. 


018 
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di 71 
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58% (50%) 
d.r. 6:1 
ee >97% 


reactions were run on 0.1- to 0.2-mmol scale. Yields were determined by 
H-NMR using an internal standard. Isolated yields in parentheses. d.r. ratios 
were determined by H-NMR analysis of the crude product. 


Exploring this approach further, we were able 
to install isopropyl (2ad), alkynyl (2ae), fluoro 
(-F) (2af), and trifluoromethyl substituents (2ag) 
on the newly formed quaternary center, all of 
which usually would require their own synthetic 
strategy (Fig. 3C) (30-33). 

Enantioenriched polysubstituted 1,4-dicarbonyls 
are important intermediates in synthesis. For 
example, diastereoselective nucleophilic additions 
to the aldehyde moiety easily afforded trisubsti- 
tuted lactone 3 (Fig. 4A) (34) or the two-step pro- 
tocol afforded y-oxobutyric acid building block 4 
(35). A direct comparison of our method to state- 
of-the-art enolate coupling highlights its advan- 
tages, in that both stereoisomers become available 
at will and in high purity under comparably mild, 
catalytic conditions. Several highly potent MMP 
inhibitors bear a 2,3-disubstituted 1,4-dicarbonyl 
backbone, and the simple access to fully function- 
alized succinate 6 with excellent stereocontrol (Fig. 
4B) is representative of the synthetic value of this 
method (2, 36). That the fully annotated precursor 5 
is elaborated in a single-step from the unconventional 
building blocks 1d and 7 is a hallmark of this strategy. 
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A Diastereoselective transformations 
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Fig. 4. Applications. (A) Diastereoselective transformations. (B) Direct stereoselective access to succinate building blocks. Yields refer to isolated 
material. d.r. ratios were determined by 1H-NMR analysis of the crude product. 
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Selective functionalization of 
methane, ethane, and higher alkanes 
by cerium photocatalysis 


Anhua Hu", Jing-Jing Guo*, Hui Pan, Zhiwei Zuot{ 


With the recent soaring production of natural gas, the use of methane and other light 
hydrocarbon feedstocks as starting materials in synthetic transformations is becoming 
increasingly economically attractive, although it remains chemically challenging. We report 
the development of photocatalytic C-H amination, alkylation, and arylation of methane, 
ethane, and higher alkanes under visible light irradiation at ambient temperature. High 
catalytic efficiency (turnover numbers up to 2900 for methane and 9700 for ethane) and 
selectivity were achieved using abundant, inexpensive cerium salts as photocatalysts. 
Ligand-to-metal charge transfer excitation generated alkoxy radicals from simple alcohols 
that in turn acted as hydrogen atom transfer catalysts. The mixed-phase gas/liquid 
reaction was adapted to continuous flow, enabling the efficient use of gaseous feedstocks 


in scalable photocatalytic transformations. 


ethane and other gaseous alkanes (C, to 

C,) have traditionally been viewed more 

as fuels than as economical chemical 

feedstocks. However, the recent discov- 

ery of huge volumes of unconventional 
reservoirs and the ensuing soaring production of 
natural gas have made these gaseous hydro- 
carbons economically attractive and strategically 
important basic raw materials (7, 2). With the 
added economic and ecological benefits of reduc- 
ing transportation costs and emissions, the direct 
transformation of gaseous hydrocarbons into 
value-added liquid commodity chemicals via the 
use of innovative homogeneous catalysis pro- 
cesses has received considerable attention in re- 
cent years (1, 3-8). 

The intrinsic inertness of C-H bonds in meth- 
ane and other gaseous alkanes has, however, 
brought extreme challenges for catalytic systems, 
not only in the activation step, but also in con- 
trolling chemoselectivity to avoid solvent func- 
tionalization and overfunctionalization under the 
frequently harsh conditions necessary (high tem- 
perature, superacid media, strong oxidants) (9). 
Moreover, the gaseous substrates’ low solubility in 
most solvents has raised substantial practical dif- 
ficulties. Elegant catalytic systems using transi- 
tion metals such as Pd, Ir, Rh, and Ru have been 
reported recently (5, 10-12); however, the chal- 
lenge remains to develop efficient Earth-abundant 
catalytic systems that operate under ambient 
conditions (J). 

Photoredox catalysis has recently emerged as a 
powerful platform for the direct activation and 
functionalization of organic molecules via open- 
shell pathways (13). Hydrogen atom transfer 
(HAT) by photocatalytically generated amine and 
aminium (74-17), thiyl (18), and other heteroatom- 
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centered radicals (19, 20) has recently enabled 
myriad otherwise unattainable C-H functional- 
ization transformations under mild reaction 
conditions (27). The unique reactivity and polarity- 
governed selectivity (22) inherent in the abstrac- 
tion events by these heteroatom-centered radicals 
inspired us to question whether a photocatalytic 
HAT strategy could be extended to the most chal- 
lenging C(sp*)-H transformations: the selective 
functionalization of methane and other gaseous 
alkanes. The nucleophilic character of alkyl rad- 
icals would provide strategic advantages for the 
development of diverse selective functionalizations, 
complementary to transition metal-catalyzed 
reactions, under mild conditions. Although gas- 
phase HAT events between methane and open- 
shell metal-oxo species have been extensively 
investigated for oxidative couplings of methane 


H 5 
iS 
H H O—R 
M K7 


C, - Cy light alkanes alkoxy radicals 


natural/shale gas highly electrophilic 


HAT H.@ oH 


at elevated temperature (23), room-temperature 
light alkane functionalization via HAT photo- 
catalysis in a homogeneous system remains large- 
ly unresolved. 

Ligand-to-metal charge transfer (LMCT) is a 
common photoexcitation manifold among coor- 
dination complexes of transition metals with an 
empty valence shell (24), which has nevertheless 
been underinvestigated and underutilized in syn- 
thetic organic transformations via modern photo- 
redox catalysis (25, 26). We recently applied 
LMCT catalysis to the direct activation of alcohol 
feedstocks, enabling alkoxy radical-mediated skel- 
etal rearrangement of cyclic alcohols (27) and 
remote C-H functionalization of primary alcohols 
via 1,5-HAT (28). Using absorbed light energy to 
promote targeted oxidation via the homolysis of 
the transiently coordinated Ce(IV)-alkoxide, chal- 
lenging oxidations of a wide range of alcohols 
have been accomplished under mild and opera- 
tionally simple conditions with cerium(IID salts 
as precatalyst. Recognizing that the ready abun- 
dance of cerium would make it an ideal catalyst 
for sustainable, large-scale photocatalytic systems 
(29, 30), we hypothesized that the synergistic 
merger of LMCT and HAT catalysis, via the use of 
inexpensive cerium salts in concert with simple 
alcohols such as methanol or 2,2,2-trichloroethanol, 
could productively activate and functionalize inert 
alkanes such as methane and ethane (Fig. 1). 
Indeed, highly electrophilic alkoxy radicals 
[CH30OH, O-H bond dissociation energy (BDE) = 
105 kcal/mol] would be thermodynamically viable 
for the abstraction of C-H bonds from methane 
(C-H BDE = 105 kcal/mol) (37). Conceivably, 
the polarity-matching effect would render the 
hydrogen atom transfer with the ubiquitous, weak 
and acidic C-H bonds of solvent (CH3CN, C-H 
BDE = 93 kcal/mol) kinetically disadvantageous, 
thereby subverting the frequently encountered 
problem of competitive solvent functionalization 
in methane functionalization. Critically, the use of 
inexpensive and widely available alcohols as HAT 


R = CHs, Methanol 
R =CH2CClg, TCE 
R = CH2CF3, TFE 


alkyl radical 


nucleophilic, reactive 


cerium catalyst, alcohol catalyst 


radical coupling reagents 


CH, H3C=FG 
gaseous ambient temperature ‘Q) value-added 
feedstocks LMCT, HAT catalysis v products 


@ Abundant, inexpensive catalysts 


Bi Mild reaction conditions 


i Diversified functionalizations 


Fig. 1. Alkoxy radical-mediated HAT activation enables diverse selective functionalizations 
of gaseous alkanes. The synergistic merger of LMCT and HAT catalysis would offer opportunities 
for the use of methane and other gaseous alkanes under economical ambient conditions. TCE, 
2,2,2-trichloroethanol; TFE, 2,2,2-trifluoroethanol; FG, functional group. 
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catalysts would allow easy tunability of reactivity 
and selectivity (32). Here, we report the applica- 
tion of LMCT-enabled HAT catalysis as a general 
method for the catalytic and selective radical 
coupling of methane, ethane, and several other 
simple alkanes. 

We sought first to apply our LMCT/HAT func- 
tionalization to the amination of methane, a 
largely elusive transformation (33). The direct 
transformation of methane into liquid tert- 
butyloxycarbonyl (Boc)-protected monomethylhy- 
drazine (MMH)—an important rocket propellant 
and valuable building block for heterocyclic com- 
pounds, currently produced in industrial quantities 
via oxidation of methylamine—would upgrade 
methane from an abundant feedstock into a 
useful commodity chemical. A detailed descrip- 
tion of our proposed catalytic mechanism is 
outlined in Fig. 2. We envisioned that a Ce(IV)- 
alkoxy complex, catalytically generated in situ 
from simple alcohols and a Ce(IV) salt, would 
undergo photoinduced LMCT to generate a high- 
energy electrophilic alkoxy radical and a reduced 
Ce(III) species. The alkoxy radical would then 
abstract an H atom from a C-H bond in the sub- 
strate alkane to generate an alkyl radical species, 
which would then readily couple with di-tert-butyl 
azodicarboxylate (DBAD) to forge a new C-N bond 
with the formation of a stable captodative N- 
centered radical. Single electron reduction of 
this radical by the reduced form of the cerium 
catalyst would regenerate the active Ce(IV) and 
deliver the desired product after protonation. 

In practice, the photocatalyzed reaction was 
carried out in a standard pressure reactor equipped 
with a sapphire window in the top plate to 
transmit 400-nm LED irradiation (see fig. S2). At 
ambient temperature under 5000 kPa of meth- 
ane with DBAD as the limiting reagent, we ob- 
served that using 0.5 mole percent (mol %) CeCls, 
in concert with catalytic 2,2,2-trichloroethanol, 
the desired Boc-protected MMH product was 
formed in 39% yield (table S6, entry 1; see tables S6 
to S10 for detailed studies). Among several com- 
mercial cerium salts we evaluated in combination 
with exogenous soluble tetrabutylammonium 
chloride, cerium(IV) trifluoromethanesulfonate 
gave slightly higher efficiency (Table 1, entry 1); 
the premade Ce(IV) chloride complex (34) also 
proved effective in this catalytic system (table S6, 
entry 2). No overfunctionalization product was 
observed in any cases, and the product proved to 
be inert when subjected to the reaction condi- 
tions (fig. S7). Moreover, the alcohol HAT catalyst 
2,2,2-trichloroethanol was left largely intact at 
the end of the reaction. The major unproductive 
side pathway was the reduction of DBAD to the 
corresponding hydrazine. An increase in absolute 
yield could be obtained by using D;-acetonitrile, 
affording di-Boc-MMH in 63% yield, as a smaller 
amount of reduced hydrazine by-product was 
formed (Table 1, entry 2). Moreover, no significant 
decrease in efficiency was observed when the 
reaction was performed at 0°C (table S9, entry 2), 
highlighting the highly reactive nature of the rad- 
ical coupling system. Lowering the catalyst load- 
ing to 0.1 mol % or even 0.01 mol % increased the 
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Fig. 2. Proposed mechanism for the cerium-catalyzed C(sp*)-H functionalization of methane 
and other gaseous alkanes. Highly electrophilic alkoxy radicals, generated from simple alcohols via 
photoinduced LMCT, are used for the HAT activation of challenging C—H bonds in light alkanes. di- 


Boc-MMH, di-tert-butyloxycarbonyl monomethylhydrazine. Me, methyl; Et, ethyl; Pr, propyl. 


C-H Alkylation 
5 mol% (n-Bu4N)CeCle CO.Et 
4 CO.Et 20 mol% CCIgCH2OH, 40 mol% TFA CO.Et 
HH EtO,C. 2 = He 
H CO-Et CH3CN (0.01M), 400 nm LEDs, rt CO,Et 
5000 kPa 1 2 56% yield 
5 mol% (n-Bu,N)sCeClg CO.Et 
4 CO,Et 50 mol% CClgCH,OH COE 
Hoon EtO,C. > Et 
H CO-Et CH3CN (0.025M), 400 nm LEDs, rt CO,Et 
atmospheric pressure 1 3 90% yield 
C-H Arylation 
5 mol% CeCls, 50 mol% CClgCH20H YQ 
f S 25 mol% TBACI, 2 equiv. (NH4)2S20g 
HoH i = ZN 
H f 2 equiv. TFA, CH4CN (0.01M) 
CHg 
400 nm LEDs, rt 
5000 kPa 4 5 19% yield 
41% recovered 4 
5 mol% Ce(SO,)s, 50 mol% CClszCH2OH 
H NN 25 mol% TBACI, 2 equiv. (NH4)2S20g = 
H tn ea. oes aN 
A ZN 2 equiv. H2SO,, CH3CN (0.025M) 
400 nm LEDs, rt Et 
atmospheric pressure 4 6 62% yield 


Fig. 3. Photocatalytic alkylations using methane and ethane. 


turnover numbers (TONs) to 320 and 2900, re- 
spectively, although this necessitated longer re- 
action times and resulted in minimally decreased 
yield (Table 1, entry 3, and table S8, entry 3). Con- 
trol experiments indicated the necessity of cerium 


catalyst, alcohol catalyst, and LED irradiation, as 
omission of any of these resulted in no desired 
amination product (table S1). 

The catalyst combination was also highly effec- 
tive for ethane functionalization (see tables S11 
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Table 1. Cerium-catalyzed amination of alkanes. Yields and regiomeric ratios (r.r.) of all products were determined by gas chromatography analysis with 


internal standard. Regiomeric ratios are presented as 1°/2° product ratios. See full details in supplementary materials. 


*With 40 mol % trifluoroacetic acid. 


tin D3-acetonitirle, in the absence of TBACI and trifluoroacetic acid. {With diisopropyl azodicarboxylate (DIAD). §At 1000 kPa of ethane. ||With 50 mol 
% alcohol catalyst. rt, room temperature; TBACI, tetrabutylammonium chloride; DBAD, di-tert-buty! azodicarboxylate. 
nmol% cerium catalyst 
20 mol% alcholol catalyst 
H 5nmol% TBACI, 1 equiv. DBAD er a? NHBoc 
Ee 
| 
cm CH3CN, rt, 400nm LEDs Boc 
Alkane Entry Cerium catalyst Loading (n) Alcohol catalyst Time Yield TON 
1 Ce(OTf), 0.5 CCI,;CH,OH 2h 45% 90 
R =H" 
at (n-Bu,N) 2CeCl, 0.5 CCl,CH,OH 2h 63% 126 
Methane (5000 kPa) 
3 Ce(OTf)4 0.01 CCl,CH2OH 18h 29% 2900 
R =Mez 4 CeCls 0.5 CCI3CH20H 4h 74% 148 
Ethane (101 kPa) 58 CeCl, 0.01 CClgCH2OH 4h 97% 9700 
6 CeCl, 0.5 CCl,CH2OH 9h 70% (1:1 rr.) 140 
RSet 
7 CeClg 0.5 CF3CH,OH 12h 61% (1:1 rr.) 122 
Propane (101 kPa) 
g! CeCl, 0.5 CH,OH 19h 39% (1:3.9 rr.) 78 
9 CeCly 0.5 CClgCH2OH 6h 76% (1:1.7 rr.) 152 
Ri=iPh 
10 CeCl, 0.5 (CH3)>CHOH 18h 40% (1:4 rr.) 80 
Butane (101 kPa) 
et CeCls 0.5 CH30H 18h 72% (1:8 rr.) 144 
Cyclohexane 12 CeCls 0.5 CClgCH20OH 16h 81% 162 


and S12 for detailed studies). Because of ethane’s 
higher solubility in acetonitrile relative to meth- 
ane, the reactions could be performed in regular 
vials under atmospheric pressure. Satisfactory 
yield and TONs were obtained using 0.5 mol % 
CeCl; and 20 mol % 2,2,2-trichloroethanol (Table 1, 
entry 4). Furthermore, lowing the cerium catalyst 
loading from 0.05 mol % to 0.01 mol % led to 
catalytic TONs up to 6900 under ambient tem- 
perature and pressure conditions (table S12, 
entry 3). Additionally, increasing the pressure of 
ethane to 1000 kPa led to a remarkable rate 
acceleration, yielding 97% of aminated product 
in just 4 hours, at 0.01 mol % loading of cerium 
catalyst, corresponding to a TON of 9700 (Table 1, 
entry 5). 

Propane and butane, the major components 
of liquid petroleum gas (LPG), a commonly used 
fuel, present interesting challenges with respect 


Hu et al., Science 361, 668-672 (2018) 17 August 2018 


to selectivity of C-H bond functionalization 
(35, 36). We hypothesized that the regioselectiv- 
ity of the C-H bond abstraction step could be 
tuned by modification of the alcohol HAT cat- 
alyst (22, 32, 37, 38). In both propane and butane, 
the methylene positions possess slightly weaker 
C-H bonds, whereas the methyl positions possess 
more numerous C-H bonds. We were pleased to 
observe that with low loading of CeCl; and simple 
alcohols (2,2,2-trichloroethanol or 2,2,2-trifluoro- 
ethanol), propane could be functionalized to 
provide aminated products with high efficiency, 
although in a nonselective manner (Table 1, 
entries 6 and 7). Using methanol as the HAT cat- 
alyst, the regioselectivity improved to 1:3.9 (Table 1, 
entry 8), favoring the methylene position, although 
the reaction proceeded at a much lower rate. 
This selectivity could arise from the electronic 
difference in the electrophilic nature of methoxy 


radical relative to halogenated ethoxy radicals. In 
a similar vein, butane could be functionalized with 
high efficiency, and the regioselectivity could be 
tuned with different choice of alcohol catalysts to 
favor the methylene positions. When using iso- 
propanol as the HAT catalyst, the 1°/2° product 
ratio could be increased to 1:4 (Table 1, entry 10); 
the lower efficiency was likely due to the decom- 
position of the isopropanol catalyst via alkoxy 
radical-mediated B-scission, as the side product 
of methyl radical coupling with DBAD was ob- 
served in the reaction (table S14, entry 5). Methanol 
proved to be a selective HAT catalyst, providing 
1:8 selectivity for the methylene C-H bonds (Table 1, 
entry 11); this result validated the hypothesis that 
enhancement of regioselectivity could be ac- 
complished through electronic tuning of the 
alcohol HAT catalyst. Pleasingly, liquid hydro- 
carbons such as cyclohexane could be aminated 
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Gas input (G) 


compressed gaseous alkane 


Liquid input (L) 
(n-BuyN)2CeClg (42 mg, 1 mol%) 
CCIgCH2OH (149 mg, 20 mol%) 

DBAD (1.15 g, 5 mmol) 

CH3CN (100 mL) 


Entry Alkane 

if methane (1800 kPa) 
at ethane (1500 kPa) 
3 propane (800 kPa) 
4 butane (400 kPa) 
5? cyclohexane (22 mL) 


with high efficiency using this inexpensive photo- 
catalytic platform at a low cerium loading (Table 1, 
entry 12). 

We next applied this photocatalytic protocol 
to radical alkylation reactions using methane 
and ethane as alkylating feedstocks. As shown 
in Fig. 3, the alkoxy-mediated methylation and 
ethylation of electron-deficient alkene 1 using 
methane and ethane delivered the desired pro- 
duct with high efficiency (56% and 90% yield, 
respectively). Furthermore, methane and ethane 
proved viable pronucleophiles in Minisci arene 
alkylation reactions. The acidic conditions made 
use of diminished product overfunctionalization 
by electronically deactivating the benzylic C-H 
bonds through protonation of the N-heterocycle 
(39). With ammonium persulfate as an eco- 
nomical oxidant, methylated and ethylated iso- 
quinoline could be produced at room temperature. 

Photocatalytic reactions in batch reactors are 
typically limited to small-scale applications be- 
cause of the attenuation effect of superficial light 
penetration (40). In addition to the increase of 
the light utilization efficiency through glass micro- 
reactors, the development of continuous-flow 
photocatalytic systems for large volume reac- 
tions could further benefit from the ease of 
handling gaseous reactants and the enhanced 
mass transfer between gas and liquid phases, 
which are crucial for the utilization of gaseous 
alkanes. Given the attractiveness of flow chemis- 
try for mixed-phase gas/solution reactions, we 
investigated the C-H functionalization of gaseous 
alkanes in continuous-flow microreactors (Fig. 4). 
An acetonitrile solution of reagents and catalysts 
was pumped as a single stream and then directly 
combined and mixed with the gaseous alkane 
feedstock in the microreactor. Using a flow setup 
of 10 parallel microreactors (4.5 ml internal 
volume in total), at a pressure of 1500 kPa for 
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the ethane gas and a flow rate of 0.75 ml/min for 
the liquid solution stream, amination product was 
furnished in 90% yield with a residence time of 
6 min and a production throughput of 2 mmol/ 
hour. Under similar conditions, propane and 
butane were functionalized with good yield 
(76% and 56%, respectively) and productivity 
(1.7 mmol/hour and 1.3 mmol/hour). In our at- 
tempts to use methane in this mixed liquid/gas 
flow reaction setting, we were hampered by the 
pressure limits of our commercial microreactors 
(1800 kPa), leading to a somewhat diminished, 
but promising, 15% yield. As for higher alkanes 
such as cyclohexane, the scaled-up amination 
could be conveniently performed with simple 
liquid injection mode, affording remarkable pro- 
ductivity of 4.2 mmol/hour. 

Further studies were performed to provide 
additional evidence for the intermediacy of alkoxy 
radicals under this catalytic manifold. As shown 
in fig. S11, methoxy, trichloroethoxy, and trifluoro- 
ethoxy radicals could each be trapped by styrene 
to generate 1,2-alkoxyamination products (80%, 
25%, and 59% yield, respectively). In the case of 
2,2,2-trifluoroethanol, a small amount of trifluoro- 
methylated product was also observed, indicative 
of a B-scission pathway and trapping of trifluoro- 
methyl radical. 

This photocatalytic platform has enabled several 
direct transformations of methane and other sim- 
ple hydrocarbons, including amination, alkylation, 
and arylation, and offers intriguing opportunities 
for further functionalizations of feedstock alkanes. 
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Collective clog control: Optimizing 
traffic flow in confined biological and 
robophysical excavation 


J. Aguilar’*, D. Monaenkova”*, V. Linevich’, W. Savoie”, B. Dutta®, H.-S. Kuan*, 
M. D. Betterton’, M. A. D. Goodisman®, D. I. Goldman”+ 


Groups of interacting active particles, insects, or humans can form clusters that hinder the 
goals of the collective; therefore, development of robust strategies for control of such 
clogs is essential, particularly in confined environments. Our biological and robophysical 
excavation experiments, supported by computational and theoretical models, reveal that 
digging performance can be robustly optimized within the constraints of narrow tunnels by 
individual idleness and retreating. Tools from the study of dense particulate ensembles 
elucidate how idleness reduces the frequency of flow-stopping clogs and how selective 
retreating reduces cluster dissolution time for the rare clusters that still occur. Our results 
point to strategies by which dense active matter and swarms can become task capable 
without sophisticated sensing, planning, and global control of the collective. 


the structure of the tunnels seems to benefit 
individuals, physical-model experiments make 
it clear that excavation can suffer as a result of 
clogging during high-traffic conditions [e.g., (74) 
and Fig. 1C]. Here we use biological, theoretical, 
computational, and robophysical systems to 
show that counterintuitive behaviors—individual 
idleness and retreating—help optimize tunnel 
density by limiting the severity and prevalence 
of clogs, thereby enabling rapid excavation by 
the collective. 

In laboratory experiments, we monitored the 
activity of fire ants as they excavated a cohesive 


iverse living (7) and artificial (2) active ma- 

terials (3) and swarms spontaneously form 

clusters that can persist for long durations. 

However, for tasks that demand steady 

flow, such formations can be disadvan- 
tageous: Confined active systems such as pedes- 
trian or vehicular traffic jams (4), competing 
bacterial biofilms (5), high-density migrating 
cells (6), jammed herds (7), and robot swarms 
(8) can produce high-density clogs that read- 
ily form glasslike arrests of flow (9). In such 
systems, the ability to dissolve clusters and pre- 
vent their formation (9), particularly in the ab- 
sence of global knowledge of the state of all 
elements, is crucial. 

Social insects (JO) perform many tasks that 
demand clog minimization and mitigation. Sub- 
strate excavation specialists such as fire ants 
(Solenopsis invicta) cooperatively create nests 
of complex subterranean networks (Fig. 1A) 
consisting of tunnels in soil that support bi- 
directional traffic without lanes (12). Our pre- 
vious laboratory experiments (12) revealed that, 
in the early stages of nest construction, the few- 
millimeter-long ants construct vertical tunnels 
approximately one body length in diameter (73). 
These narrow tunnels benefit the climbing ants 
as they transport bulky pellets, because close 
proximity to walls allows limbs, body parts, and 
antennae to aid slip recovery (72). But although 


Fig. 1. Confined and crowded A 
biological and robotic excava- 
tors. (A) X-ray reconstruction 
of S. invicta fire ant excavation 
in a large container (25 cm 
wide) filled with 240- to 
270-uum-diameter glass particles 
(supplementary materials). 

(B) Painted S. invicta workers 
excavating a single tunnel along 
the wall of a transparent con- 
tainer with 0.25-mm-diameter 
wet glass particles. (©) Auton- 
omous robotic diggers excavat- 
ing in a simulated environment 
with cohesive granular media 
diameter of 1.8 cm). The inset 
shows the number of pellets 
defined as a cohesive group of 
grains) deposited versus time 
T) by a robot excavating alone 
red dots) and the net excava- 
tion of four robots (blue 
circles), whereby each robot 
attempts to excavate maxi- 
mally. Orange dashed line indi- 
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granular medium. Groups of ~30 workers were 
placed in transparent containers containing 
particle-water mixtures (73) consisting of 0.25-mm- 
diameter glass particles (Fig. 2A) with a soil 
moisture content, defined as the ratio of total 
water weight to total solid weight, of 0.01 or 
0.1 (three trials each) (73). Ants excavated for 
48 hours, with individual ants entering and exit- 
ing the tunnel hundreds of times. As in our pre- 
vious study (72), ants constructed narrow vertical 
tunnels by means of a stereotyped process of 
grain and multigrain (pellet) removal and trans- 
port, followed by tunnel ascent and substrate 
deposition upon exit (13). A camera mounted 
to a motorized linear stage tracked a region 
within about three body lengths from the tun- 
nel face (Fig. 2A and supplementary materials). 
We distinguished individual ant activity by mark- 
ing ant abdomens with different colors (Fig. 1B). 
We recorded tunnel length over time (Fig. 2B), 
and the presence of each worker was logged 
when in the camera’s view (Fig. 2C). 

Ants exhibited a variety of behavioral tasks 
during collective excavation. A large fraction 
(0.22 + 0.1 for soil moisture content of 0.01 and 
0.31 + 0.13 for soil moisture content of 0.1) of 
ants never entered the tunnel to excavate during 
the 48-hour period of observation; we refer to 
these as “nonvisitors.” As seen in Fig. 2C, ants 
that visited the tunnel face (“visiting” ants) varied 
in activity level. Inspired by work in honey bees 
(15), we quantified activity inequality among vis- 
itor ants using Lorenz curves. Points on the 
Lorenz curves in Fig. 2D link the cumulative 
fraction of workers in the population to the cumu- 
lative share of activity by that fraction. Although 
visitor ants’ trips did not always result in the ex- 
traction of a pellet (see movie S1 and discussion 


/ 1 robot ,° | 
° 


cates the hypothetical performance of the group of four robots in the absence of confinement. 
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Fig. 2. Biological 
observations reveal 
workload inequality 
and reversal behaviors 
in ants. (A) Experimen- 
tal apparatus to track 
ant excavation; the inner 
diameter of the con- 
tainer is 5.21 cm. (B) The Cc 
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tation” map derived from 
experimental data. Each E 
point in the map indi- 
cates the presence of a 
particular ant (out of 
30 ants), ordered from 
most active to least 
active (y axis) in the 
tunnel at a time t [soil 
moisture content (W) of 
0.1]. (D) Lorenz curves 
for workload distribu- 
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0.25-mm-diameter glass particles with soil moisture content of 0.1 (blue) and 0.01 (red) and a CA 
model (green) whose excavation rate was optimized with a GA. Shaded areas correspond to 
standard deviation from three experiments. cum., cumulative. The inset shows average Lorenz 
curves + SD/2 for a workload distribution within the group before (control, purple) and after 
(removal, blue) the most active diggers are removed from the group. Error bars correspond to 
standard deviations from three experiments. (E) Illustration of observed reversal behavior. 

(1) Ant Y’s path to excavate is blocked by ant Z. (II) After Z collects a pellet, it reverses, (Ill) forcing Y 
to reverse without excavating. (F) Total number of reversal events versus total ant visitors for the 
first 3 hours of ant excavation (soil moisture content of 0.1). Each data point represents total reversal 
events and total entries counted for 30-min segments collected from three experiments. Linear fit 
(blue line) with coefficient of determination (R2) = 0.69. 


below), we included these “reversals” in the Lorenz 
curve calculations because these animals expended 
energy in a trip to the tunnel face and contributed 
to tunnel traffic. 

To characterize the Lorenz distributions, we 
calculated the Gini coefficient, G, defined as 
the ratio of the area between the Lorenz curve 
and the line of equality to the area under the 
line of equality (15). G is a measure of the de- 
viation of the workload from perfectly shared 
(G = 0, all workers work equally) to completely 
unshared (G = 1, a single worker performs all 
work). Lorenz curves were characterized by 
G = 0.75 + 0.10 and displayed similar func- 
tional forms across a variety of experimental 
conditions (see Fig. 2D and fig. $2). 

In the presence of competing tasks, like 
foraging or brood care, task allocation in ants 
can change depending on colony needs (16). To 
investigate temporal variation in ant excavation 
workload, we divided 48-hour experiments into 
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12-hour “epochs” (time periods). Although indi- 
vidual activity varied among epochs (Fig. 2C 
and fig. S3), the cumulative workload distri- 
bution was independent of epoch [one-way 
analysis of variance (ANOVA) F329 = 0.85, P = 
0.48] and soil moisture content (one-way ANOVA, 
Figg = 2.54, P = 0.13) (Fig. 2D, figs. S1 and $2, and 
table S1). Furthermore, when the most active excava- 
tors were removed from the group, remaining 
workers increased their activity and compensated 
for the loss, preserving the shape of the Lorenz 
curve and therefore producing similar Gini co- 
efficients (one-way ANOVA, F, 4 = 1.13, P = 0.35) 
(Fig. 2D, inset; table S2; and supplementary 
materials). Thus, given the consistency of the 
workload distribution, we hypothesize that var- 
jations in idleness (low activity levels) within a 
population may play an adaptive role in mod- 
ulating the crowded conditions of confined 
tunnels and could have been important in the 
earliest social insect colonies (17). 
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Reversal behaviors were characterized by ants 
entering the tunnel and returning to the exit with- 
out carrying soil pellets. During the first 3 hours of 
the experiments, reversals occurred for 26 + 13% 
of trips for soil moisture content of 0.01 and 18 + 
3% of trips for soil moisture content of 0.1. These 
events were often associated with local crowd- 
ing at the excavation face (Fig. 2E) (16 + 12% of 
trips for soil moisture content of 0.01 and 10 + 
2% of observations for soil moisture content of 
0.1). Reversal behaviors in crowded conditions 
occur on foraging trails (78), and similar phenome- 
na have been observed in swarming bacteria (5). 
The incidence of this seemingly unproductive 
behavior increased with increasing overall ac- 
tivity of ants (Fig. 2F), suggesting that this be- 
havior serves as a feedback mechanism for 
mitigating clogs during excavation. 

To systematically examine the effects of idle- 
ness and individual retreating behaviors on ex- 
cavation performance, we developed a cellular 
automata (CA) excavation model (Fig. 3A and 
supplementary materials). Such models are use- 
ful in elucidating the dynamics of biological and 
vehicular traffic (9, 19). The model consists of a 
lattice (the “tunnel”) with a width of two cells 
[similar to S. invicta tunnel widths (20)] occu- 
pied by soil, empty space, an ascending CA “ant,” 
and/or a descending CA ant (Fig. 3A). The CA ants 
can move, change directions, excavate, deposit a 
pellet, or rest. As in the biological experiments, 
activity for the workload distribution in the CA 
model was measured by counting instances 
when CA ants visited the tunnel within three 
body lengths (cells) of the excavation site. 

We simulated the behavior of CA ants using 
both equal workload distributions (which we 
refer to as “active” CA ants) and unequal work- 
load distributions (which we refer to as “Lorenz” 
CA ants) with identical reversal probabilities 
(movie S2). In unequal workload distributions, 
individual CA ants were assigned individual 
“entrance probabilities” defined as the prob- 
ability that a CA ant will enter the tunnel. The 
initial entrance probability distribution for the 
30 CA ants was taken from the biological dis- 
tribution. Output workload distributions of CA 
simulations closely matched the input entrance 
probability distributions (as measured by the 
Gini coefficient, fig. S25). During a time-step, if 
its path toward the excavation area was blocked, 
a CA ant would reverse direction toward the exit 
with a probability, R, of 0.34 (supplementary 
materials); R was set by the proportion of total 
reversal events observed for 0.01 soil moisture 
in the biological experiments. 

The CA model that used unequal workload 
distribution and reversals reproduced experi- 
mentally observed biological ant digging rates 
(Fig. 2B). To determine if these rates represented 
an optimal workload distribution, we used a ge- 
netic algorithm (GA) (fig. $24) to select for 
entrance probability distributions (supplemen- 
tary materials) that maximized excavated tun- 
nel length within a given duration. Regardless 
of the initial population distribution (either 
similar to the ants or highly unequal), within a 


2 of 6 


810z ‘9} IsnBny uo /610'beweousalos’eoue!0s//:dy]y Wo. pepeojuMOGg 


RESEARCH | REPORT 


B 1 
Soil Tunnel 
) 
5 
2o6 
F=0 Ants eS 
: Misew s : 
T <i O = 
BL = 5 
cm fy 


0 Cum. fraction of workers 1 


T=1 
c 0 Generation 50 
30 T T r 
a 
a | 
20 Unequal 
£ 
2 Equal 
o 
+10 
D W,=2 BW 
= W,=3 BW 
= W;=4 BW 
°9 0.5 0.5 1 
Gini Coefficient, G Reversal probability 
n 
E 100 
50 
0.08 7 30 
= 25 
=. 
= | 20 
a oO 18 
S 15 
E 0.04 Oh ; iF 
[= a 8 
O 1 5 
oOo 3 
0 L 2 
0 1 2 3 4 


X (ants/W,) 


Fig. 3. Models reveal optimized traffic flow in narrow tunnels by means of selective retreating 
and workload inequality. (A) Schematic showing the main components of the CA model. Cell 
colors denote soil (light gray), tunnel (white), ants moving toward the excavation site (orange), and 
ants exiting the tunnel (dark gray). 7, simulation time-step. (B) Gini coefficient over time under GA 
optimization for groups started with a completely equal (purple), completely unequal (blue), and 
random (green) workload distribution. Lorenz curves (inset) for groups that begin with complete 
equality or inequality rapidly reach a similar workload distribution. (©) Excavated tunnel length, L+, 
after 24-hour simulation time versus Gini coefficient for tunnels of different widths, Wy, for a 30-—CA 
ant population. BL, body length; BW, body width. (D) Excavated tunnel length after 24-hour 
simulation time versus reversal probability for equal and unequal (optimized for 30 CA ants) 
workload distributions. (E) Simulated traffic flow (g, number of ants divided by time in seconds times 
tunnel width) versus CA ant occupancy (A, number of ants divided by tunnel width, measured in 
excavator body widths) for groups of equally (squares) and unequally (circles) active ants. Color 
bar indicates the size, n, of the excavating group. The theoretical fundamental diagram of the OAT 
model (yellow curve) illustrates the need to limit tunnel traffic to one worker per body width of tunnel 
width to optimize flow and prevent deleterious clogs. Experimental ant observations reveal an 
average occurrence around this density (orange-shaded region, where the orange centerline is the 
mean and the extents are one standard deviation away from the mean). 


few generations, the GA simulation converged 
to an unequal workload distribution (Fig. 3B, 
for a 30-ant example), which was similar to the 
experimentally observed biological workload dis- 
tributions (Fig. 2D, green). 
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The CA model also revealed the importance of 
the reversal behavior in conjunction with unequal 
workload distributions. Although the active exca- 
vation could be improved by sufficient reversal 
probability, only a small amount of reversal was 
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needed to increase the excavation performance 
in the unequal distribution (Fig. 3D). Thus, in 
addition to the benefits narrow tunnels provide 
for climbing and pellet transport (12, 13), we hy- 
pothesize that the ants benefit from narrow tunnels 
by expending less energy to dig wider tunnels to 
the same depth. Such benefits would be useful in 
the early stages of new nest construction (e.g., 
after the colony is flooded out) during which 
establishing the colony underground is critical. 

To gain insight into other benefits and con- 
straints set by such narrow tunnels, we simulated 
30 CA ants with varied workload distributions 
(characterized by distinct Gini coefficients) in 
tunnels of different widths. These distributions 
were created through a randomized Monte Carlo 
process, such that the Lorenz curves resulted in 
desired Gini coefficients. A peak in excavated 
length, L, versus Gini coefficient was observed 
in a tunnel two cells wide (Fig. 3C). Wider tun- 
nels (three and four cells wide) resulted in 
broader performance peaks, indicating a de- 
creased sensitivity in performance owing to 
workload distribution. This indicates that use 
of a narrow tunnel necessitates the “discovery” 
of the unequal workload distribution of ants. 

We hypothesized that the unequal workload 
distribution and reversals were linked to uniform 
flow of CA ants in the tunnel. We therefore 
measured the average flow rate of successful 
excavators, g, versus the average tunnel-width- 
normalized occupancy of excavators, i (the ratio 
of average number of ants in the tunnel to tun- 
nel width measured in ant body widths). To gen- 
erate a wide range of average occupancies, we 
varied the population size of the CA system. 

The flow rate was optimal at an intermediate 
occupancy (Fig. 3E). This nonmonotonic trend 
in g versus A is characteristic of various multi- 
agent systems, including bridge-building army 
ants (27) and vehicle traffic (22, 23), and is 
referred to in traffic literature as the “funda- 
mental diagram” (24). Active ants, which do not 
modulate their workload distribution, increase 
tunnel occupancy with increasing population 
and thus exhibit optimal flow rates for only a 
few population sizes. By contrast, GA-optimized 
Lorenz ants produced tunnel occupancies in the 
ideal range by generating increasingly unequal 
workload distributions for increasing CA ant 
population sizes. Of particular importance, fire 
ants produced tunnel densities in the ideal range 
(Fig. 3E, orange-shaded region). 

The ability of the ants to operate at the op- 
timum in the fundamental diagram and the 
rapidity by which the GA model converges 
(Fig. 3B) indicate the existence of a simple gov- 
erning principle for traffic control in confined 
task-oriented systems. To elucidate this principle, 
we formulated a minimal model of ant traffic in 
the narrowest (single-lane) tunnel: the one-at- 
a-time (OAT) model. This model, which builds 
on recent work on traffic of motor proteins on 
microtubules (25), allows us to estimate ana- 
lytically how the excavation rate varies with the 
rate of ants entering the tunnel (supplementary 
materials) for various work-distribution strategies. 
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In the OAT model, ants enter the tunnel and 
move toward its face; descending ants reverse 
direction if they either reach the end of the tun- 
nel or collide with an ant moving in the other 
direction (supplementary materials). We ini- 
tially modulated occupancy by varying the en- 
trance probability of all ants equally; as in the 
CA model, the flow rate of the OAT model was 
optimal at an intermediate width-normalized 
ant occupancy (Fig. 3E, yellow curve, and sup- 
plementary materials)—in particular, one exca- 
vator for every excavator that can fit along the 
width of the tunnel. Although the peak in the 
fundamental diagram has been associated with 
the transition between steady flow and prop- 
agating traffic jams (24), the OAT model high- 
lights a key feature of confined tunnel excavation: 
Traffic dynamics are driven by tunnel width. 
Given the task-oriented nature of the system, 
successful traffic flow is only possible if a worker 
can travel the entire length of the tunnel and 
back. Thus, if there are enough workers in the 
tunnel to clog the path to or from the excavation 
site, traffic is likely to slow down. The OAT 
model highlights this scenario, as such clogs 
are unavoidable if more than one ant is in the 
single-lane tunnel. Because ants cannot pass each 
other or change lanes, only the first ant to enter 
can reach the end to excavate, whereas other 
workers collide with the first worker, reverse, 
and impede traffic. 

Mechanisms that target a specific number 
of excavators occupying the tunnel given the 
tunnel’s width promote ideal traffic flow. When 
individual ants in the OAT model were pro- 
grammed to modulate their rate of reentry 
according to how often they reversed without 
excavating, the OAT model rapidly converged 
to Lorenz curves similar to the biological and 
GA-optimized CA ants (supplementary materials 
and fig. S23). Such rapid convergence highlights 
the benefit of targeting a specific number of ants 
(in this case, by establishing unequal workload 
distributions) in narrow tunnels. 

We next used a system of excavating robots 
(fig. S9) to test if the above theoretical strategies 
could improve traffic in confined experimental 
situations with more complex, unpredictable 
interactions. Because, presently, robot mobility 
in real-world environments is poor relative to 
biological systems and because real collisional 
interactions not modeled in CA and OAT are 
typically neglected in swarming robot studies 
(2), such robophysical (26) studies can aid ro- 
bot design and control for real-world robot 
swarms, as well as suggest hypotheses for 
studies of ant traffic (J8), adaptive behaviors, 
and morphological features for crowded exca- 
vation and movement. 

Groups of roughly elliptical robots (movie S3) 
with similar aspect ratios to the biological ants 
were tasked with excavating a model cohesive 
granular medium of hollow plastic spheres con- 
taining loose magnets; this design allows clumps 
of media to be formed, analogous to the pellets 
of cohesive soil formed by the biological ants 
(13). Our robots followed simple instructions 
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triggered by onboard sensory feedback of the 
surrounding environment (supplementary ma- 
terials). Previous work in swarm robotics (27) 
used similar decentralized strategies in conjunc- 
tion with collision-avoidance schemes (2, 28) 
to produce emergent flocking behavior. By con- 


trast, our robots detected collisions with push 
switches on their outer shell, which triggered 
navigation strategies such as steering away and 
readjusting to promote clog resolution (movie S4). 

To challenge the robots, we constructed a 
tunnel (Fig. 4A) with a width of three robot 
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Fig. 4. Traffic flow and local dynamics during robot excavation. (A) Schematic of the 
excavation arena indicating the tunnel length, L; (excluding the excavation area); robot width, 
We; robot length, Ly; and tunnel width, Wy. A pink centerline along the tunnel was monitored by 
the robots’ onboard cameras, enabling them to follow the tunnel path. (B to D) Experimental 
space-time overlap heat maps of robot positions (x axis) for four-robot trials of (B) active 
digging, (C) Lorenz digging, and (D) reversal digging. Color indicates the number of robots 
occupying a particular space and time: one (purple), two (orange), three (yellow), and four 
(white) robots. Histograms above the graphs show the frequency of occurrence of clusters with 
two or more robots at different lateral positions. (E) Average flow rate, g, + SD measured in 
deposits per minute versus number of robots in the experiment, N, for active (green), Lorenz 
(light blue), and reversal (maroon) strategies. (F) Illustration of various collision scenarios 
encountered by robots owing to movement toward guide trail (top), turning (middle), and 
forward-backward translation (bottom). Orange starbursts indicate collisions. (G to 1) Relaxation 
times for all strategies: active (green), Lorenz (light blue), and reversal (maroon). (G) Relaxation 
time versus cluster size, N., for three-robot (solid) and four-robot (dashed) trials. The inset shows 
sample average correlation curves, Q(1), that measure how N, = 1 (purple), 2 (orange), 3 (yellow), 
and 4 (white) robot clusters dissolve over time during four-robot reversal trials; shaded region 
indicates average curves + SD/2 (standard deviations for T* range from 100 to 500 s). 

(H) Relaxation times versus linear aggregation density, A¢, for four-robot trials and (1) 
corresponding number of cluster occurrences, /., versus linear aggregation density. 
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widths (or 1.5 robot lengths), which, combined 
with the oblong robot shape, forced a challenge 
of turning around in confined spaces. We tracked 
the positions (supplementary materials and 
fig. S10) of the robots in the main tunnel area 
(i.e., excluding the excavation site) to generate 
space-time overlap maps of robot positions (see 
Fig. 4, B to D), which give visual insight into robot 
flow during excavation. 

We first examined systematically how exca- 
vation performance changed as numbers of 
robots increased for our active protocol (as in 
Fig. 1C and fig. S6), which assigned equal work 
“desire” to all diggers: After soil deposition, each 
robot immediately returned to the tunnel to ex- 
cavate. Despite constraints on maneuverability, 
sensing, and morphology, the robophysical ex- 
periments demonstrated qualitatively similar per- 
formance to the ants and the computational 
and theoretical models. For example, measure- 
ment of the average flow rate, g, of successful 
excavators (which we quantify here as the num- 
ber of deposits per minute) revealed that excava- 
tion performance increased with an increasing 
number of robots in the trial (V) until the sys- 
tem became sufficiently crowded (Fig. 4E). 

To characterize how clustering led to per- 
formance degradation in the active protocol, 
we measured the frequency of cluster occur- 
rences, denoted /,. Here we defined clusters as 
groups of robots of number NV, whose center 
positions were within a robot length of each 
other (supplementary materials). Such clusters 
occurred most frequently at the excavation site 
(histograms in Fig. 4, B to D), yielding phase 
separation (29) in the system, whereby a portion 
of robots were jammed at high density, whereas 
others moved smoothly through the tunnel at 
low density. 

As in (9), we also measured the character- 
istic “relaxation” times for clusters using a tool 
from the study of glassy systems, the density over- 
lap correlation function Q(t). Q(t) compares the 
spatial overlap of a cluster at a specific time 
to the overlap of the cluster’s original lateral 
segment at a later time, t. Assuming a one- 
dimensional tunnel, we calculated the spatial 
overlap of robots by tracking their centroid po- 
sition laterally (along the tunnel) and assigning 
intensity potentials in space, summing over- 
lapping potentials of adjacent robots (fig. S12A). 
From these curves (Fig. 4G, inset; fig. S12; and 
supplementary materials), we calculated the 
relaxation time, t*, for clusters of different 
N., by fitting a stretched exponential function, 


Q(t) = exp{ —(3)', to a Q curve averaged 


over clusters of the same N,, where f is a fitting 
parameter that is of order unity. 

The relaxation time analysis highlighted how 
sufficient numbers of active robots (N = 4) 
resulted in clustering cascades. For example, 
N. = 2 robot clusters could be sufficiently dif- 
ficult to resolve before a third robot joined the 
cluster, which in turn led to catastrophic N, = 
4 robot jams that spanned the tunnel width. 
Such clogs were then difficult to resolve with the 
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robots’ limited sensory and motor capabilities 
and were likely exacerbated by the robots’ rigid 
oblong shape (Fig. 4F). A sharp increase in t* 
for clusters with a linear density, Ac = Ne/Le, 
where L, is cluster length in body lengths, greater 
than unity (multirobot clusters) during four-robot 
active trials (Fig. 4H, green curve) revealed how 
this cascading scenario is reminiscent of glassy 
arrest in particulate systems (30, 37). 

To discover how the strategies of idleness 
distributions and reversals affected clustering 
and traffic dynamics in the robots (movie S5), 
we implemented two protocols inspired by the 
biological observations and theoretical models. 
As in the CA model, in the Lorenz protocol (fig. $8), 
we implemented an unequal probability to 
enter the tunnel derived from experimental ant 
workload inequalities. We also implemented a 
separate robot reversal protocol (fig. $7), which 
produced selective retreats, whereby the robots 
were programmed to immediately resume exca- 
vation after deposition but reversed after not 
successfully reaching the excavation site within 
a given time. These strategies led to different 
excavation performances as N increased; but 
most importantly, both strategies outperformed 
the active protocol at N = 4 (Fig. 4E). 

The relaxation times and cluster analysis re- 
vealed the mechanisms by which the different 
protocols mitigated clogging, particularly in the 
distinct ways in which they reduced the dura- 
tion of clusters and thus optimized the average 
occupancy of excavators, thereby improving traf- 
fic flow. For trials with up to three robots, all 
strategies produced a relatively low 1* (Fig. 4G) 
and frequency of cluster occurrence regardless 
of the number of robots in a cluster. However, 
for N = 4, the Lorenz and reversal protocols 
mitigated the clogging effects associated with the 
aggressive excavation in the active protocol. 

The Lorenz and reversal protocols provided 
distinct forms of mitigating the catastrophic 
cascades of clogs found in the active protocol: 
Unequal workload distributions reduce the oc- 
currence of clusters, and selective retreating 
limits the duration of clogs. Selective retreating 
in the reversal strategy limited the duration of 
clogs. Thus, instead of the glass-forming char- 
acteristics of active robots, clusters dissolved 
after some time, yielding low t* (Fig. 4, G and 
H). The unequal workload distributions of the 
Lorenz strategy reduced the occurrence of clus- 
ters, especially the highest-density four-robot 
clusters (Fig. 41), where glasslike clog forma- 
tion is most likely to occur, resulting in fewer 
catastrophic clogs at the excavation site. We 
found similar evidence for clog mitigation in 
the analysis of clusters in the CA model (fig. S26 
and supplementary materials), whereby clog miti- 
gation was further found to be most effective 
when both strategies (reversals and unequal en- 
trance probabilities) were used in combination. 

To close, we return to the traffic aspects of 
the confined system: As in theory, traffic flow 
of robotic ants (which dominates the excava- 
tion performance) was maximal at an intermediate 
occupancy of excavators, 1 = Ny /Wr, where Nr is 
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the number of robots in the main tunnel area 
averaged across all frames of video and W, is the 
width of the tunnel, followed by a gradual de- 
cline at higher A (fig. SILA). However, unlike the 
theoretical models, peak flow rate in robotic sys- 
tems occurred at a of approximately 0.25, which 
corresponds to less than one robot traversing 
the tunnel at a time, despite a tunnel width of 
about three robot widths, or 1.5 robot lengths. 

We hypothesize that the underperformance 
of our robots relative to the biological and the- 
oretical systems is a consequence of our robots’ 
limited mobility in confined spaces, indicating 
that deformable bodies (32) and novel locomo- 
tor mechanisms (72) will be important in con- 
fined real-world robot collectives. That said, 
given these strategies are robust to the vagaries 
of real-world interactions, we posit that other 
engineered systems—including robot swarms in 
disaster rubble, nanorobots surging through the 
bloodstream (33), and task-capable active ma- 
terials (3)—could benefit from simple strategies 
that involve labor inequality, particularly in cre- 
ative combinations (34). 
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HIGH-PRESSURE PHYSICS 


Insulator-metal transition in dense 


fluid deuterium 


Peter M. Celliers’*, Marius Millot’, Stephanie Brygoo”, R. Stewart McWilliams’, 
Dayne E. Fratanduono’, J. Ryan Rygg’*, Alexander F. Goncharov’, Paul Loubeyre”, 
Jon H. Eggert’, J. Luc Peterson’, Nathan B. Meezan’, Sebastien Le Pape’, 

Gilbert W. Collins’*, Raymond Jeanloz®, Russell J. Hemley’ 


Dense fluid metallic hydrogen occupies the interiors of Jupiter, Saturn, and many 
extrasolar planets, where pressures reach millions of atmospheres. Planetary structure 
models must describe accurately the transition from the outer molecular envelopes to the 
interior metallic regions. We report optical measurements of dynamically compressed 
fluid deuterium to 600 gigapascals (GPa) that reveal an increasing refractive index, the 
onset of absorption of visible light near 150 GPa, and a transition to metal-like reflectivity 
(exceeding 30%) near 200 GPa, all at temperatures below 2000 kelvin. Our measurements 
and analysis address existing discrepancies between static and dynamic experiments for 
the insulator-metal transition in dense fluid hydrogen isotopes. They also provide new 
benchmarks for the theoretical calculations used to construct planetary models. 


he transformation of hydrogen from a mo- 

lecular insulator to an atomic metal at high 

densities has been a longstanding focus 

in physics and planetary science (1). The 

unique quantum metallic properties of the 
low-temperature solid (i.e., below 300 K) have 
drawn sustained interest (2-5), and character- 
izing the transformation in the hot, dense fluid 
is crucial for understanding the internal struc- 
ture and dynamics of giant planets (6), includ- 
ing the origin of their large magnetic fields (7). 
Numerous studies of the insulator-metal (IM) 
transition in dense fluid hydrogen, beginning 
with theoretical work five decades ago, predicted 
a first-order transition in the fluid (8-10) with a 
critical point at very high temperatures (~13,000 
to 15,000 K) and 60 to 90 GPa. However, the first 
experimental work on the IM transition in the 
fluid, carried out using dynamic compression 
techniques, provided evidence for a continuous 
transition with metallic states reached in the 
pressure range P = 50 to 140 GPa and temper- 
atures T = 3000 to 8000 K (JJ-13). More recent 
predictions (14-17) placed the critical point at a 
much lower temperature (~2000 K). This moti- 
vated several experimental studies using static 
diamond anvil cell (DAC) techniques (J8-22) 
and dynamic compression (23) to probe the fluid 
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properties below 2000 K and up to several hun- 
dred GPa. 

Dynamic compression can explore a broad 
range of thermodynamic paths with time-varying 
manipulations of the applied pressure and con- 
trolled reverberation of pressure waves through 
the sample. This includes probing the dense fluid 
at temperatures below 2000 K, for example, with 
an initial jump in pressure delivered by a shock 
wave followed by shock reverberation or gradual 
ramp compression. The first demonstration of 
this strategy was carried out on deuterium with a 
magnetic compression technique at the Z facil- 
ity (23). The results showed strong optical ab- 
sorption beginning in the range 100 GPa < P< 
130 GPa, followed by weak fluctuating reflec- 
tance in the range 130 GPa < P < 300 GPa, and 
culminated in abrupt jumps to high reflectance 
near 300 GPa. Knudson et al. (23) attributed the 
absorption to band gap closure and determined 
that the reflectance jumps were associated with 
the first-order IM transition. The reflectance jumps 
occurred at higher pressures upon compression 
than upon decompression, plausibly as a result of 
thermal conduction. Meanwhile, improvements 
in static compression methods have allowed the 
exploration of the behavior of the fluid over part 
of this pressure-temperature (P-7) range (up to 
170 GPa and >1800 K) (18-22, 24-26). Changes 
in optical properties from 120 to 170 GPa de- 
pending on temperature were attributed to 
the IM transition (J8-20, 22), whereas other 
experiments suggest the persistence of a finite 
(~1 eV) band gap at similar conditions (27). 

The IM transition is the subject of a number 
of continuing theoretical studies (14-17, 27-29) 
that consistently predict a discontinuous transi- 
tion below a critical point near ~2000 K, but over 
a broad range of pressures. Density functional 
theory (DFT)-based calculations show a spread 
in the transition pressure spanning 150 GPa, 
arising from the sensitivity of the boundary to 
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the choice of exchange-correlation functional 
used and whether zero-point energy is accounted 
for (/, 16). Quantum Monte Carlo (QMC) calcu- 
lations should provide improved bounds on the 
transition pressures (16, 17), although they dis- 
agree with a recent benchmarking experiment 
(30). Transition pressures for hydrogen and deu- 
terium are expected to be different because of 
isotope effects, but with a small relative magni- 
tude. The transition in deuterium from QMC 
simulations is 30 GPa higher than in hydrogen 
at 600 K, decreasing to 10 GPa higher at 1200 K 
(16). Despite experimental support for a first- 
order IM transition (19, 20, 22, 23), the critical 
point has not been experimentally identified. 
Furthermore, the broad discrepancies in the mea- 
sured transition pressure (20, 22, 23) and character 
(20-23) have made resolving the differences be- 
tween the theoretical models challenging. 

We completed a series of five dynamic com- 
pression experiments at the National Ignition 
Facility (NIF) to probe the IM transition up to 
600 GPa at temperatures ranging from 900 K to 
1600 K. The experiments were carried out using 
168 laser beams to deliver up to 300 kJ of ultra- 
violet light that drove a near-isentropic reverber- 
ation compression of a cryogenic liquid deuterium 
sample. We adjusted the time dependence of the 
laser delivery (pulse shape) to control the com- 
pression sequence imposed on the sample as a 
function of time. Line-imaging Doppler veloc- 
imetry recorded both the compression history 
and the evolution of the optical properties of the 
D, sample during the nanosecond compression 
process, using a probe laser operating at 660 nm. 

In our experimental setup, the fluid deuterium 
sample is sandwiched between a copper piston 
and a LiF window and is viewed through the 
window by the diagnostic (Fig. 1). The upper 
half of the view shows signal from the light re- 
flected at the D2-LiF interface where a 100-nm- 
thick aluminum film was deposited (position 
>0O um in top panels of Fig. 1, B and C). The lower 
half shows the signal from light reflected initial- 
ly at the piston surface (double-passed through 
the transparent sample layer). Because the two 
reflecting interfaces move differently, the two 
halves of the field of view display different ap- 
parent velocities (different fringe phases) until 
the time when the light intensity on the sample 
side reaches a minimum. At later times, the ap- 
parent velocity on the sample side matches that 
of the AIl-LiF interface (common fringe phase), 
indicating that the optical reflection has shifted 
from the piston surface to the D,-LiF interface. 

The copper piston (lower half) started moving 
near ¢ = 10 ns when an initial weak shock was 
transmitted into the sample layer. We controlled 
the first shock strength for each experiment in 
order to follow different isentropic compres- 
sion paths. The first shock strengths varied from 
1.8 GPa to 3.4 GPa (table S1). After the first shock 
crossed the sample layer, it reverberated against 
the LiF window (upper half, 16 ns) and continued 
to reverberate between the piston and the win- 
dow while the sample layer was compressed. 
After several reverberations, both the piston and 
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window interface approached a common appar- 
ent velocity by 27 ns. At this point, the sample 
was compressed to conditions estimated to be 
near density p = 0.8 g/cm’, P = 20 to 30 GPa, 
and T = 600 to 900 K and was observed to be 
fully transparent. A second pressure wave arrived 
at the sample layer at 30 ns and initiated a sec- 


ond set of reverberations, further pressurizing 
the sample to nearly 600 GPa in four experi- 
ments and to 215 GPa in one experiment. The 
duration of this second set of reverberations 
was shorter than the first, accomplishing much 
of the pressure increase in less than 3 ns. The 
transition from optically transparent to strongly 


reflecting occurred during the second set of re- 
verberations and is attributed to the IM tran- 
sition in deuterium. 

The thermodynamic path of the sample was 
inferred from hydrodynamic simulations con- 
strained by the measured interface velocities 
(31, similar to previous work (23). Different 
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Fig. 1. Schematic of target and experiment together with raw data. 
(A) A planar sample layer of liquid deuterium, 31 wm thick, contained 
between a 70-um-thick copper piston and a 500-um-thick LiF window, 
is mounted onto the side of a hohlraum (brown in bottom sketch), driven 
with the NIF laser (blue arrows), and diagnosed with a line-imaging Doppler 
velocity interferometer (Sensor positioned on right). (B) Example data 
record from experiment N150914-2. Fringe amplitude encodes reflectivity 
and fringe phase is proportional to the Doppler shift of the reflected light 
(piston and window interface velocities); the frame below shows apparent 
velocity signals extracted from the fringe phase and corresponding 
simulated velocities. At times 30 < t < 31 ns, simulations (black 
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dashed curve) with the extrapolated Dewaele et al. (36) refractive index 
fit do not match the observations and require a corrected index (green 
dashed curve) to match observations; additional frames below show 
pressure, temperature, and density at a point in the center of the 
sample estimated from simulations matched to the velocity data 
based on three different EOS models for deuterium. (©) Details of the 
information presented in (B) with magnified time axis. Vertical gray 
lines denote the transition from partially transparent to opaque; vertical 
magenta lines indicate the time when the reflectivity at the D/LIF 
interface exceeds 30%. The normalized reflectance shows independent 
measurements from the two detectors. 
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equation of state (EOS) models for deuterium 
(32-34) resulted in calculated pressures that 
are nearly the same to within 1% independent 
of the EOS model. We estimate that the pres- 


sure along the compression path is accurate to 
+3 GPa (measurement error of the velocities), 
enabling determination of the metallization 
pressure to <3% accuracy (after accounting for 
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Fig. 2. Optical signatures of the insulator-metal transition. Curves in all frames are matched 

for the five experiments following isentropes that correspond to the colored P-T bands in Fig. 3. 

(A) Reflectivity at the D2-LiF interface as a function of pressure, after the D2 layer becomes opaque. 
Inset: Raw data from experiment N171105-3, showing a sharp reflectivity jump just before t = 32 ns. 
(B) Absorption coefficient; legend indicates experiment number and T along isentrope near P = 200 GPa. 
Colors and line types apply to all panels. (C) Index of refraction from simulations matched to the 
observed Doppler shifts and, for comparison, extrapolation of the Dewaele et al. (36) refractive index fit 
to higher density (solid gray curve). (D) For P < 150 GPa, optical (AC) conductivity as extracted by 
combining the data in (B) and (C) (symbols with colors matched to curves) and corresponding Lorentz 
model fits (curves); for P > 150 GPa, the DC conductivity inferred from the reflectivity data in (A) and the 
Smith-Drude model with t = 0.075 fs (31). Error bars in (A) to (C) represent SD of multiple 
measurements collected in a set of velocity bins and mapped to the pressure scale. 
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measurement error in the reflectance). The EOS 
models compare well with recent benchmark- 
ing experiments (13, 30, 35). However, none of 
the current EOS models represent the IM tran- 
sition as a first-order phase transition (i.e., with 
a density jump and latent heat), and the range 
of temperature predictions among EOS models 
is large, leading to uncertainties of as much 
as +260 K near the IM transition. Density dif- 
fers by less than +6% among the models. 

Several optical signatures are associated with 
the transition from transparent to reflecting 
(Fig. 2). While the sample layer remained par- 
tially transparent, we determined the absorp- 
tion coefficient up to a magnitude of about 
1um (Fig. 2B), when the optical depth of the 
sample layer exceeded ~2 and the layer became 
opaque. When the sample was opaque, light con- 
tinued to be returned from the target because 
of the reflection from the D.-LiF interface, with 
reflectance near 10%. While the sample was 
still partially transparent, we observed changes 
in the real part of the index of refraction that 
are consistent with band gap closure. We ex- 
tracted the real part of the index of refraction 
(Fig. 2C) by fitting to the apparent velocity of 
the piston (observed through the deuterium 
sample) self-consistently with our hydrodynamic 
simulation (37) (Fig. 1). The real part of the index 
agrees closely with that of insulating hydrogen 
at lower pressure (36) but deviates from the low- 
pressure linear density dependence as the band 
gap decreases. 

The gap energy, which we estimated from the 
refractive index (31), approaches ~2 eV when the 
sample layer becomes opaque, consistent with 
the probe photon energy of 1.9 eV. Extrapolation 
(linear in density) suggests that for the fluid 
deuterium states achieved in our study, the band 
gap closes in the range 200 GPa < P < 250 GPa 
(31) depending on the experiment (fig. S22B). 
The ~10% reflectance we observed when the 
sample becomes opaque (Fig. 2A) is consistent 
with the expected reflectivity of the pressurized 
interface between materials of different indices 
of refraction, 2p, ~ 2.8 and nyjz ~ 1.5, and rep- 
resents a minimum reflectivity of insulating 
deuterium in this experiment. Further pressur- 
ization reveals a rapid increase in reflectivity at 
the D,-LiF interface eventually reaching a satu- 
ration level near 55%. Reflectivity of ~30% at the 
D.-LiF interface corresponds to the minimum 
metallic electrical conductivity opc = 2000 S/em 
(11, 37); values exceeding this indicate metallic 
behavior (37). From this measurement, we in- 
ferred that the IM transition occurs in the pres- 
sure range between approximately 150 GPa and 
250 GPa. Although we did not observe a dis- 
continuous change in reflectivity through this 
range for three of our experiments, finite tem- 
poral resolution in the measurements may have 
obscured a sharp transition; one experiment 
(Fig. 2A, magenta dashed curves and inset) re- 
corded at higher time resolution revealed a sharp 
reflectivity increase. In all cases, the reflectivity 
increase steepened noticeably near 200 GPa. 
Of the four experiments that reached 600 GPa 
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peak pressure, we recorded some of the reflectiv- 
ity data during pressure relaxation; unlike prior 
experiments (23), there is no evidence for different 
reflectivity signals during increasing (dP/dt > 0) 
and decreasing (dP/dt < 0) pressurization. 

We can fit our results to simple models of the 
evolution of the optical properties through the 
transparent, optically absorbing, and optically 
reflecting regimes. The absorption has a steep 
pressure dependence and increases with tem- 
perature (Fig. 2B), most likely as a result of 
disorder in the material. In the partially trans- 
parent regime, we extracted the optical conduc- 
tivity (Fig. 2D, symbols) directly from our data, 
using the expression o(@) = nac (Gaussian units) 
combining 7 (Fig. 2C) and the absorption co- 
efficient a (Fig. 2B); here, c is the speed of 
light. A Lorentz optical model incorporating a 
density-dependent oscillator frequency matches 
these data (Fig. 2D, curves below 150 GPa). We 
estimated the DC conductivity above the tran- 
sition (Fig. 2D, P > 150 GPa) with the Smith-Drude 
model (2/, 38) evaluated with a fixed relaxation 
time t = 0.075 fs and variable backscatter pa- 
rameter matched to the optical reflectivity (Fig. 
2A and fig. S25). The conductivities determined 
here exhibit trends similar to those calculated 
theoretically (14). 

We plotted the optical absorption and reflec- 
tivity signatures along the calculated temper- 
ature versus pressure (7-P) paths extracted from 
the simulation models (Fig. 3), referred to by 
their T near 200 GPa (table S1). Our observa- 
tions of the onset of optical absorption (Fig. 3, 
open circles) are in good agreement with the re- 
sults of McWilliams et al. (27) and Knudson e¢ ai. 
(23) (Fig. 3, black dashed line). These observa- 
tions, along with the extrapolated band gap en- 
ergy as a function of P (fig. S22B), imply that the 
band gap remains finite leading up to the IM 
transition. These conclusions differ from those of 
several DAC measurements performed at similar 
conditions (18-20, 22). The DAC measurements 
using laser heating produced plateaus in tem- 
perature with increasing laser power that were 
interpreted as a signature of the metallization 
transition (J8-20, 22). However, the heating 
plateaus in the static experiments are better 
correlated with the onset of absorption in our 
experiment rather than metallization, consist- 
ent with alternative interpretations of those data 
(21, 23, 39). The observation of absorption well 
below the point of band gap closure is also found 
in the solid at lower temperatures (40, 41). 

We found saturation of the reflectivity above 
~280 GPa in the four experiments carried out 
to the highest pressures. In these datasets, the 
transition temperature exhibited a clear trend 
decreasing with pressure by ~20 K/GPa. The 
lowest observation at T ~ 1100 K showed the 
transition occurring at 221 GPa. One experi- 
ment at even lower T ~ 940 K did not reveal 
reflectivity greater than 10% to the maximum 
P ~ 215 GPa, but does not rule out the possibility 
of a transition at higher P. Pulsed-heating DAC 
experiments on H, (20, 22) found reflectivity 
saturation at temperatures similar to our ex- 
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Fig. 3. Phase diagram of H2/D>2 at high P-T conditions. Present results: Threshold where 
optical absorption coefficient exceeds ~1 um7! at 660 nm and the band gap is ~2 eV (black open 
circles connected by thick dashed line); points where the D/LIF interface reflectivity exceeds 
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(16) and Knudson et al. (23) are shown, including the CEIMC calculation for deuterium (dashed 
green curve) and DFT calculations with quantum corrections for the ions [vdW-DF1 for deuterium 
(dot-dashed blue curve), vdW-DF2 for deuterium (dark blue curve), and PBE for hydrogen (dashed 


orange curve)]. 


periments and at pressures that were lower by 
~20 GPa, which may be consistent with the ex- 
pected isotope effect. 

The steep increase in reflectivity with pressure 
that we observe near 200 GPa contrasts with the 
interpretation of the ramp compression experi- 
ment on deuterium carried out at the Z facility 
(23). Although the onset of optical absorption is 
consistent with our work, the subsequent weak 
reflectivity followed by jumps near 300 GPa 
(attributed to the IM transition) occurred at 
higher pressure than observed here and in other 
later studies (20, 22), whereas the low (<2%) 
reflectivity of the D.-LiF interface recorded at 
intermediate conditions is not consistent with 
either insulating or semiconducting deuterium 
and LiF under pressure (31, 36, 42, 43). Our 
data indicate onset of the IM transition at P ~ 
200 GPa; the optical reflectivity saturates to a 
constant value of ~55% near 280 GPa and re- 
mains stable over hundreds of GPa, similar to 
the pressures where high reflectivity is displayed 
in the Z experiments. This shows that both ex- 
periments are probing similar states at onset 
and completion of the IM transition, but not at 
intermediate conditions. 
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We propose that the different results are re- 
lated to the time scale of the Z experiments being 
two orders of magnitude longer, allowing tur- 
bulent lateral flows within the sample layer to 
be established during the observations. These 
effects lead to a different interpretation of the 
reflectivity jumps observed in the Z data, cor- 
relating these with completion rather than onset 
of the IM transformation (37). This requires a 
correction of the inferred transition temperatures 
from the Z-machine to lower values (table $3), 
owing to the latent heat of the IM transition. 
Although there are uncertainties in the tem- 
perature estimates in both experiments, we ex- 
pect that those uncertainties are accentuated 
by the longer time scales, which give rise to 
multiple mechanisms of heat transport in the 
Z experiments. The much shorter time scales 
in our experiments preclude these from occur- 
ring, so that the temperature estimates for our 
experiments depend primarily on the EOS mod- 
els in the molecular fluid phase. 

We interpreted the onset of reflectivity above 
10% as the beginning of pressure-induced mo- 
lecular dissociation associated with the IM 
transition. The extent and P-T range of the IM 
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transition are in approximate agreement with 
the DFT-based molecular dynamics calculations 
using the van der Waals (vdW-DF1) exchange- 
correlation functional (44) reported in Knudson 
et al. (23). Our observations and analyses are 
also consistent with the coupled electron-ion 
Monte Carlo method (CEIMC) calculations of 
Pierleoni et al. (16) and with the quantum Monte 
Carlo calculations of Mazzola et al. (17). They are 
about 100 GPa higher than DFT calculations 
based on the Perdew-Burke-Ernzerhof (45) (PBE) 
exchange-correlation functional (15, 29) and about 
50 GPa lower than DFT calculations based on the 
van der Waals vdW-DF2 functional (23, 46). In the 
saturation regime, the estimated ope ~ 10* S/em 
is in good agreement with the recent estimates 
from lower-pressure experiments (22) and with 
calculations (14). 

Our interpretation of the Z experiments does 
not alter the conclusion that the abrupt reflec- 
tivity jumps observed in those experiments are 
evidence of a first-order IM transition. The 
revised 7 = 700 K is well below the predicted 
critical-point temperature Tcp, whereas one of 
our experiments provides evidence for first-order 
behavior up to 1100 K. The extrapolated band 
gap energies (linear in density; fig. S22B) reach 
closure at higher P than the corresponding ob- 
served transition P, indicating that the gap closes 
nonlinearly and could indicate first-order behav- 
ior at temperatures as high as 1640 K. Therefore, 
although 7¢p is not yet precisely identified, it is 
bounded in the range 3000 K > T¢p > 1100 K, as 
indicated by our data and by continuous trans- 
formations observed in prior experiments to 
higher temperature (J/). 

Our revision of the IM transition data from 
the Z experiments produces points that lie close 
to and even below previous melting-line studies 
on hydrogen (24, 26). A straight line connect- 
ing our data to the revised Z data has a slope of 
-5 K/GPa. This invites speculation as to whether 
a thermodynamic triple point with coexisting 
crystalline solid, insulating fluid, and liquid 
metal phases is located near 600 K and 300 GPa. 
If so, at higher pressures the insulating crystal- 
line solid would transform directly to the liquid 
metal. Because the melting line of deuterium 
has not been determined and because quantum 
effects are important, the existence and location 
of such a triple point remains an open question. 

Demixing in dense H-He fluid mixtures, an 
important process expected to occur inside giant 
planets (47, 48), is closely connected with the IM 
transition (49). Recent state-of-the-art calcula- 
tions of H-He demixing are all based on ab initio 
calculations with increasing levels of sophistica- 
tion (49-55). The two most recent calculations of 
H and He EOS for planetary models are based on 
DFT molecular dynamics (56, 57) with the PBE 
exchange-correlation functional, and until this 
year, ab initio-based demixing studies (53, 54) 
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were also based on PBE. The emerging theoret- 
ical (16, 17) and experimental (23) evidence that 
PBE underestimates the IM transition pressure 
motivated Schottler and Redmer (55) to examine 
H-He demixing with the vdaW-DF1 functional. 
They showed a clear shift of the demixing bound- 
ary to higher pressure and lower temperature 
relative to PBE-based calculations. Thus, al- 
though the first-order IM transition occurs at 
much lower temperature than the Jupiter and 
Saturn isentropes, it plays an important role as 
a benchmarking feature to validate the func- 
tional on which the H-He demixing calculations 
are based. Our data suggest that the vaW-DF1 
functional is the current best choice. 
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MEMBRANES 


3D printed polyamide membranes 


for desalination 


Magsud R. Chowdhury’, James Steffes”, Bryan D. Huey”, Jeffrey R. McCutcheon 


Polyamide thickness and roughness have been identified as critical properties that affect 
thin-film composite membrane performance for reverse osmosis. Conventional formation 
methodologies lack the ability to control these properties independently with high 
resolution or precision. An additive approach is presented that uses electrospraying to 
deposit monomers directly onto a substrate, where they react to form polyamide. The small 
droplet size coupled with low monomer concentrations result in polyamide films that are 
smoother and thinner than conventional polyamides, while the additive nature of the 
approach allows for control of thickness and roughness. Polyamide films are formed with a 
thickness that is controllable down to 4-nanometer increments and a roughness as low as 

2 nanometers while still exhibiting good permselectivity relative to a commercial 


benchmarking membrane. 


he thin-film composite (TFC) membrane 

has served as the desalination industry’s 

standard membrane for more than 30 years. 

During that time, this membrane has 

changed little. The composite structure com- 
prises a polyester backing layer for mechanical 
support, a porous supporting polysulfone mid- 
layer cast through phase inversion, and an ul- 
trathin, highly cross-linked polyamide film that 
is dense enough to separate salt ions from water 
but thin enough to have a low resistance to water 
transport. This polyamide layer is formed in situ 
onto the porous midlayer via interfacial polym- 
erization. This approach relies on a reaction be- 
tween an amine [m-phenylene diamine (MPD)] 
in an aqueous phase and an acid chloride 
[trimesoyl chloride (TMC)] in an organic phase. 
The immiscibility of the two phases permit the 
reaction to occur only at the phase boundary. 
Film growth is limited to the boundary and sub- 
sequently self-limits the reaction as reactants are 
blocked by the growing film. The result is a self- 
terminated, but uncontrolled, film growth with a 
thickness between 100 and 200 nm and a rough 
ridge-and-valley-like surface morphology (7-3). 
Although these membranes exhibit excellent 
permselectivity compared with any other desali- 
nation membrane, certain features of the film 
properties and its fabrication procedure are in- 
herently limiting. The intrinsic roughness of these 
films have long been attributed to a high fouling 
propensity for reverse osmosis and nanofiltration 
processes (4, 5). Additionally, the thickness of the 
membrane, which is inversely proportional to its 
permeance, is relatively uncontrolled because the 
process simply self-terminates as the film forms. 
Last, the properties of the support layer surface— 
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including pore size, pore spacing, surface poros- 
ity, roughness, and surface chemistry—affect 
the interface between the two phases and thus 
the membrane performance in unpredictable 
ways (6-8). 

A better polyamide desalination membrane 
should have the same permselective properties 
as those of existing membranes but also be tun- 
able in each of these other properties. The thick- 
ness should be reduced to maximize permeance 
while still ensuring that the films are sufficiently 
robust so as to withstand necessary hydraulic 
pressures. The roughness should be minimized 
to lessen the likelihood that the membrane will 
foul and also improve cleaning efficiency. Last, 
the film properties should be decoupled from 
the substrate properties, allowing these selec- 
tive films to be deployed on any type of substrate. 

To better control thickness and roughness, 
Gu et al. used a molecular layer-by-layer ap- 
proach to build polyamide layers onto ultrafil- 
tration (UF) membranes. Using a polyelectrolyte 
layer to prime the surface of a porous substrate, a 
polyamide layer could be formed by molecular 
layer through a sequential interfacial polymeri- 
zation method (9). Karan et al. used a sacrificial 
nanostrand layer as a support to form free- 
standing polyamide films with varying thickness 
and roughness for organic solvent nanofiltration 
applications with no demonstration of desalina- 
tion performance (0). These methods and others 
(11-16) are complex and are unlikely to scale easily 
for commercial production. 

Electrospray can be used to deposit monomers 
as nanoscale droplets that form polyamide onto 
a substrate. During electrospraying, liquid leaves 
a needle in the presence of a strong electric field. 
Coulombic repulsion forces the ejected droplets 
to disburse with diameters well below 1 um (Fig. 1, 
A and B) (17). This characteristic drew Fenn et al. 
to use the technique for mass spectrometry of 
large polar biomolecules (8, 19). Others followed 
by using the technique to make thin films (20-23), 
nanoparticles (24), or patterns (25-27). For our 
approach, we deposit individual monomers onto 
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a substrate, where they can subsequently polym- 
erize on the surface. 

The approach is illustrated in Fig. 1, A and B. 
The drum is grounded and connected to the two 
needles by means of a high-voltage dc power 
source that can generate up to 30 kV. The dis- 
tance between the needle tip and drum is kept 
at 2 to 3 cm. Each needle extrudes one of the 
monomers in solution. MPD (in water) and TMC 
(in hexane) were kept at a molar ratio of 4:1 over 
a wide range of concentrations (table S1). A lipo- 
philic ionic liquid was added to the organic phase 
in order to increase the electrical conductivity 
(fig. S2A). A variety of UF membrane substrates 
with different pore sizes (fig. S2B), pure water 
permeance (fig. S2C), and hydrophilicity were 
studied (fig. S2D and table S2). In each case, the 
substrate was first attached to the rotating drum 
(Fig. 1A). As monomer solutions emerged from 
the needle tips, they sprayed and deposited onto 
the collector surface and reacted upon contact 
with each other. To ensure coverage over the en- 
tire substrate, the needle stage traverses along the 
collector surface (Fig. 1B). A single pass over the 
collector surface is referred to as a “single scan.” 

Films were printed on aluminum (Al) foil in 
order to demonstrate the ability to characterize 
polyamide films to find properties such as cross- 
link density, thickness, and mechanical proper- 
ties. After printing, the films are transferred from 
the foil (fig. S3A) to any substrate or kept as a 
free-standing film (Fig. 1C). Having thicker films 
that can be manipulated by hand allows for easier 
characterization of film properties. This type of 
manipulation is difficult with conventional poly- 
amide films because of their thinness, fragility, 
and integration into the supporting structure of 
typical TFC membranes. For example, determi- 
nation of cross-link density of the polyamide film 
is typically done with x-ray photoelectron spec- 
troscopy (XPS) (28). However, this method can 
be inaccurate because of surface roughness, insuf- 
ficient sample size, and compositional heterogene- 
ity with depth. Instead, manipulating a 1-um-thick 
polyamide into a thicker, crumpled form (fig. S4) 
allows us to use energy-dispersive x-ray spec- 
troscopy, which penetrates far deeper into the 
sample and thus provides a better measurement 
of bulk polyamide composition. The cross-link 
density is found to be 83%, which is reasonable 
for a film made from MPD and TMC monomers 
(fig. S4) (10). 

Films are also printed at various MPD and 
TMC concentrations (table S1) onto Al foil and 
then transferred to a silicon wafer for thickness 
measurement by using atomic force microscopy 
(AFM) (fig. S3B). Cross sections at the film edges 
(fig. S5) reveal the film profile with respect to the 
underlying planar substrate. Lower monomer con- 
centrations not only result in a thinner polyamide 
film but also greater control of film thickness per 
scan. Polyamide films as thin as 20 nm were made 
based on five scans, indicating a mean thickness 
of just 4 nm per scan (Fig. 1D). Control of thickness 
per scan was notably consistent; linearity in film 
growth with an increasing number of scans is 
depicted in Fig. 1E. 
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Films of the same composition were also 
printed onto porous polymeric substrates in 
order to evaluate their thickness, surface morphol- 
ogy, roughness, desalination performance, and 
substrate independence. Cross-sectional trans- 
mission electron microscopy (TEM) images are 
shown in Fig. 1, F to I, and fig. S6. The polyamide 
layers printed on the three UF membrane sub- 
strates exhibit similar thicknesses (Fig. 1, F to H) 
as those printed on Al foils (Fig. 1D). We note 
repeatability in thickness from Fig. 11, where five 
layers of polyamide film measuring 15 + 3 nm 
each are visible. This thickness per scan corre- 
sponds well to thickness per scan data captured 
on Al foil in Fig. 1D by means of AFM. We also 
confirm linearity in thickness with TEM images 
shown in fig. S6. 

We examined the surface morphology of the 
polyamide films formed on polymeric substrates 
using scanning electron microscopy (SEM) (Fig. 2A 
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and figs. S7 and S8). Compared with the typical 
ridge-and-valley-like morphology of conventional 
polyamide films, such as the industry-standard 
Dow SW30XLE RO membrane (Fig. 2A), signif- 
icantly smoother polyamide films are formed 
on all substrates at all monomer concentrations. 
These results are quantified by means of AFM 
analysis as shown in Fig. 2B. The root mean 
square (RMS) roughness increases with increas- 
ing monomer concentration (Fig. 2C) and the 
number of scans (Fig. 2D). For each monomer 
concentration, film roughness is similar among 
all of the substrates evaluated (fig. S9 and tables 
$3 and S4). The maximum roughness (40 + 4 nm) 
is observed for the highest MPD:TMC concentra- 
tion, 0.5:0.3 (Fig. 2C), when formed on the PAN450 
UF substrate. However, even these roughest 
films exhibit less than half of the roughness of the 
Dow SW30XLE membrane (Fig. 2C, dotted orange 
overlay). The lowest concentrations of monomers 
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tested here yield films with roughness values of 
less than 2 nm and are indistinguishable from 
the substrate’s roughness. 

The desalination performance for all mem- 
branes tested are presented in Fig. 3A, where 
higher salt rejection and water permeance are 
desired. Using the SW30XLE as a control and 
for benchmarking purposes, six of our mem- 
branes had both higher rejection and water 
permeance (within the Fig. 3A gray rectangular 
overlay), and 30 are higher in one metric or the 
other. Although it was not the intent of this 
work to outperform an industry-standard mem- 
brane in conventional metrics of water permeance 
and salt rejection, these membranes can have 
tailorable thickness and substantially lower rough- 
ness while exhibiting comparable (or better, in 
some cases) performance. 

Water permeance (Fig. 3B) and salt rejection 
(Fig. 3C) are shown to have a strong dependency 


Number of scans 


PS20 TFC 


PAN450 TFC 


Fig. 1. Details of the electrospray process for printing substrate- 
independent polyamide films with thickness control. (A) A side view 

of a schematic of the electrospray process. (B) The top view schematic 
shows the needles and a stage assembly that can move “horizontally” for 
uniform coatings on a rotated drum. A single sweep across the substrate is 
denoted as a single scan. (C) A free-standing polyamide film measuring 1.1 um 
thick in air, along with the cross-section from SEM. (D) Polyamide thickness 
as a function of MPD and TMC loading, including the corresponding thickness 


Chowdhury et al., Science 361, 682-686 (2018) 17 August 2018 


per scan. (E) Polyamide thickness as a function of the number of scans at 

a MPD:TMC concentration ratio of 0.125:0.075. For characterization data 
presented in (C) to (E), the polyamide was prepared on an Al foil substrate 
and then separated according to fig. S3A. (F to I) Cross-section TEM of 

(F) PANSO, (G) PS20, and [(H) and (I)] PAN450 TFC membranes made 
with five scans and a MPD:TMC concentration ratio of 0.5:0.3. The displayed 
thickness and error represents 20 measurements from the images, 

except for (H), where only the thinnest region is measured. 


2 of 4 


810z ‘91 IsnBny uo /6i0 beweoualos’s0ue!0s//:dy1y Wo. pepeojuMOGg 


RESEARCH | REPORT 


--A-- 0.125 : : 0.075 
--4- 0.0625 : 0.0375 


MPD:TMC MPD:TMC 
A 0.0625 : 0.0375 ) 
PS20 TFC @ 5 scans | 
B | MPD:TMC JN M0.0625 : 0.0375 5 0.083 : 0.05 _ i 0.125 : 0.075 | | 
1 um 4 a's, 
200 we 
n) 
150 . u 
PS20: ©" He... ~~... - ee en 
Substrate , 100 
50 : 
se 
O0nm + 
Pe 
Pat 
No. of scans 1 y 3 
c 100 
—a— PAN50 TFC 
PN ee RR Ee Nee —@®— PAN450 TFC __ 
—4— PS20 TFC 
--DOW SW30XLE 


Roughness (nm) 


0.0625 
0.0375 


0.125 
0.075 


Fig. 2. Dependency of surface morphology and roughness on 
printing conditions. (A) SEM image of TFC membranes at 100,000~x 
magnification for different concentrations of MPD and TMC. The 
underlying substrate and a Dow SW30XLE membrane are shown 

as controls. (B) A series of 3- by 3-um AFM topography images reveal 
increased surface roughness with the MPD:TMC concentration ratio, 
either consistently with five scans (top) or due to successive scans for 
the specific MPD:TMC concentration ratio of 0.5: 0.3 (bottom). The first 
column displays the substrate only, without any polyamide film for 


on monomer concentration. Higher concentra- 
tions of monomers form thicker (Fig. 3A) and 
less permeable films (Fig. 3B) while improving 
salt rejection (Fig. 3C). The efficacy of the TMC 
membranes can also be considered by redefining 
such data in terms of permselectivity, provided 
in fig. S10, where again these membranes simi- 
larly outperform conventional membranes. 
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The substrate selection has a noticeable effect 
on permeance. This is attributed to pore size and 
spacing on the substrate. The most permeable 
substrate (fig. S2C, PAN 450) exhibits the largest 
pores that are also closest together. This means 
that water diffusing through the film has less 
distance to travel to desorb through an open 
pore into the porous support, resulting in higher 
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2 


---- DOW SW30XLE 


3 4 5 6 7 8 9 
Number of scans 


comparison. The inset numbers indicate either the concentration ratio 
or the number of scans. (C) Graph showing RMS surface roughness 

of the TFC membranes by using three different UF membranes as 
substrates for a series of MPD:TMC concentration ratios. The first 

points in the graph represent the roughness of the substrate only. 

(D) The surface roughness increases with the number of scans for 

three different MPD:TMC concentration ratios for PS20 TFC membranes. 
The commercial Dow SW30XLE TFC RO membrane is shown as a 
dotted line in (C) and (D) for benchmarking. 


permeance (6, 29, 30). These higher permeance 
values enabled our best performing membranes 
to match the upper-bound limit of the selectivity- 
permeability tradeoff relationship as described 
in (74) (fig. S11). Furthermore, there was no sub- 
strate effect on rejection as expected (Fig. 3C) 
because rejection is primarily a function of the 
selective film chemistry and structure. These film 
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Fig. 3. Desalination performance of printed 
polyamide membranes. (A) NaC! salt 
rejection and pure water permeance for all of 
the investigated membranes. (B and C) Com- 
parison of pure water permeance and NaCl salt 
rejection, respectively, between UF substrates 
for TFC membranes made with five scans 
over ~1 order of magnitude increase in MPD 
and TMC loading. The commercial Dow 
SW30XLE TFC RO membrane is shown as a 
dotted line in (B) and (C) and as an orange 
star point in (A) for benchmarking. 


features are indistinguishable when deposited 
onto the three substrates. 

Further tuning of desalination performance is 
done by changing the number of scans and hence 
polyamide thickness (fig. S12). Some of the thinnest 
membranes exhibited very high permeance, al- 
though the highest of these has correspondingly 
low salt rejection (~10%). The TFC membranes 
made with five scans and an MPD:TMC ratio of 
0.083:0.05 on the PAN 450 UF membrane exhibited 
areasonable salt rejection of 94%, with a permeance 
of ~14.7 liter m~ hour! bar! (LMH bar”?). This 
membrane also exhibited an RMS roughness only 
2.3 nm higher than the substrate RMS roughness 
of 11.7 nm. This is less than one-sixth that of the 
SW30XLE membrane. Rejections as high as 95% 
were achieved on the same substrate for a MPD: 
TMC ratio of 0.125:0.075, with a RMS roughness 
only ~4.3 nm greater than that of the substrate 
and a water permeance of 3.68 LMH bar‘. In- 
creasing the number of scans to 10 yielded a salt 
rejection of 97.5% while still maintaining a water 
permeance of 2.87 LMH bar and a RMS rough- 
ness of less than 20 nm. 

This additive approach to making TFC mem- 
branes has resulted in membranes with tunable 
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thickness and roughness while still retaining the 
selectivity expected of reverse osmosis membranes. 
These membranes have an intrinsic smoothness 
not seen in other TFC membranes today, can be 
tailored to thicknesses as low as 15 nm with as 
little as 4-nm resolution in thickness control, and 
can be formed on substrates without prepara- 
tion. Furthermore, by decoupling the polyamide 
formation from the substrate properties, we have 
enabled the formation of TFCs on unconvention- 
al substrates and allowed for film characteriza- 
tion that would be impossible with polyamide 
films formed through conventional interfacial 
polymerization. The adaptation of this approach 
to other monomers or even simple polymers 
dissolved in solvents might enable the develop- 
ment of other TFC membranes for use in other 
separations. 
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PHOTOLUMINESCENCE 


Origin of the bright 


photoluminescence of few-atom silver 
clusters confined in LTA zeolites 


Didier Grandjean‘, Eduardo Coutifio-Gonzalez”, Ngo Tuan Cuong®*, Eduard Fron, 
Wouter Baekelant”, Saleh Aghakhani’, Philomena Schlexer’, Francesco D’Acapito®, 
Dipanjan Banerjee’, Maarten B. J. Roeffaers®**, Minh Tho Nguyen‘, 


Johan Hofkens”, Peter Lievens’ 


Silver (Ag) clusters confined in matrices possess remarkable luminescence properties, but 
little is known about their structural and electronic properties. We characterized the bright 
green luminescence of Ag clusters confined in partially exchanged Ag—Linde Type A (LTA) 
zeolites by means of a combination of x-ray excited optical luminescence-extended x-ray 
absorption fine structure, time-dependent-density functional theory calculations, and 
time-resolved spectroscopy. A mixture of tetrahedral Ag,(H20),2* (x = 2 and x = 4) clusters 
occupies the center of a fraction of the sodalite cages. Their optical properties originate 
from a confined two-electron superatom quantum system with hybridized Ag and 

water O orbitals delocalized over the cluster. Upon excitation, one electron of the s-type 
highest occupied molecular orbital is promoted to the p-type lowest unoccupied molecular 
orbitals and relaxes through enhanced intersystem crossing into long-lived triplet states. 


ew-atom luminescent silver clusters (AgCLs) 

(D stabilized through organic (such as pep- 

tides, proteins, polymers, and DNA) (2-6) or 

inorganic (such as glasses and zeolites) (7-9) 

templates have emerged as promising candi- 
dates for a broad range of applications in lighting, 
imaging, sensing, and therapeutics (4). Compared 
with conventional quantum dots, few-atom AgCLs 
combine an ultrasmall size with excellent size- 
dependent photoluminescence (PL) spanning the 
ultraviolet to near-infrared spectrum. Strong 
quantum confinement of Ag valence electrons ap- 
pears to break up the continuous density of states 
into discrete energy levels and confer molecular- 
like properties to the AgCLs. Nevertheless, the 
lack of a detailed understanding of the funda- 
mental photophysical mechanisms underlying 
their emissions is hampering the rational design 
of AgCLs with improved and tailored optical 
properties. Atomic structures for AgCLs have not 
been determined unambiguously because of their 
vast distribution of size, environment, and tem- 


‘Laboratory of Solid State Physics and Magnetism, KU 
Leuven, Celestijneniaan 200D, B-3001 Leuven, Belgium. 
?Molecular Visualization and Photonics, KU Leuven, 
Celestijnenlaan 200F, B-3001 Leuven, Belgium. *Faculty of 
Chemistry, Hanoi National University of Education, 136 Xuan 
Thuy, Hanoi, Vietnam. “Department of Chemistry, KU Leuven, 
Celestijneniaan 200F, B-3001 Leuven, Belgium. ~Dipartimento 
di Scienza dei Materiali, Universita di Milano-Bicocca, via 
Cozzi, 55, 20125 Milano, Italy. °Consiglio Nazionale delle 
Ricerche-Istituto Officina dei Materiali-Operative Group in 
Grenoble (CNR-IOM-OGG), European Synchrotron Radiation 
Facility (ESRF) LISA Collaborating Research Group, Grenoble, 
France. ‘Dutch-Belgian beamline, ESRF-The European 
Synchrotron, CS 40220, 38043 Grenoble, France. 8Centre for 
Surface Chemistry and Catalysis, KU Leuven, Celestijnenlaan 
200F, B-3001 Leuven, Belgium. 

*Corresponding author. Email: didier.grandjean@kuleuven.be 
(D.G.); maarten.roeffaers@kuleuven.be (M.B.J.R.); peter. 
lievens@kuleuven.be (P.L.) 


Grandjean et al., Science 361, 686-690 (2018) 


plate interactions, as well as the presence of a 
large fraction of nonluminescent Ag species. 
The AgCLs that self-assemble in the cavities of 
the rigid aluminosilicate crystalline framework 
of zeolites have the most homogeneous and ef- 
ficient emissions. The PL of AgCLs confined in 
thermally activated Ag-loaded zeolite structures 
of faujasite (FAU) and Linde Type A (LTA) topol- 
ogies features tunable absorption and emission, 
large Stokes shifts, and exceptionally high exter- 
nal luminescence quantum efficiencies reaching 
unity (8, 10). However, the structure of these AgCLs 
has not been fully elucidated yet because of the 
complexity of Ag-zeolite host-guest interactions 
and the sensitivity of Ag-zeolite composites toward 
radiation (electrons and photons) used in struc- 
tural characterization techniques (7). We present 
a detailed investigation of the structural and 
electronic properties of partially Ag-exchanged 
Ag3Ko-LTA zeolite by use of three complemen- 
tary techniques. This system was selected for its 
green PL featuring an excellent external lumines- 
cence quantum yield of 23% among the Ag-LTA 
zeolites, its good stability toward x-ray irradiation 
(11), and its simpler crystallographic structure 
than that of FAU. With x-ray excited optical 
luminescence (XEOL), the x-ray absorption fine 
structure (EXAFS) signal is detected exclusively 
from the Ag atom fraction involved in the PL 
process at the Ag K-edge, thus selectively deter- 
mining the structure of the emitting Ag species 
(12). This univocal assignment could not be made 
in earlier work (10); hence, approach may provide 
more detailed understanding of luminescent prop- 
erties for a variety of few-atom Ag clusters. The 
structures obtained experimentally were confirmed 
computationally with geometry optimizations by 
using density functional theory (DFT) methods, 
while time-dependent DFT (TD-DFT) was applied 
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to determine the electronic transitions responsi- 
ble for the absorption and emission spectra of 
the stable isomers. Last, to confirm the electronic 
structure of the theoretically modeled AgCLs, we 
identified the relevant decay modes and time 
scales involved in the absorption and luminescence 
processes of Ag3Ko-LTA using a combination of 
femto- to millisecond time-resolved spectroscopies. 

The large number of structural characteriza- 
tions of AgCLs stabilized in LTA zeolites, often 
performed by means of x-ray diffraction and elec- 
tron spin resonance (13-16), have led to numerous 
incomplete and often contradictory structural 
models. Ag3.4 clusters were tentatively related 
to partially Ag-exchanged LTA zeolites, whereas 
Agg Clusters were associated to fully exchanged 
samples. High-resolution transmission electron 
microscopy (HRTEM) revealed octahedral Ags 
clusters in the sodalite cages of fully exchanged 
Ag-LTA zeolites (17), but no evidence linking this 
structure to the PL was given. Similar analysis of 
partially exchanged Ag-LTA zeolites remained un- 
successful because of the strong influence of 
electron-beam irradiation (17, 18). 

EXAFS (19) provides information on cluster 
size and atom bonding both for the emissive and 
nonemissive clusters (20). By contrast, XEOL ex- 
clusively detects the XAFS signal from the atoms 
constituting the emissive species (12, 21, 22). 
Also, although x-ray irradiation can affect the 
structure of few-atom clusters (11), XEOL would 
monitor any beam degradation effect (supple- 
mentary materials). The three-dimensional (3D) 
structures of the AgCLs were determined by 
combining the fitting results of the XEOL and 
transmission-detected EXAFS collected simulta- 
neously (table S1). 

We primarily analyzed the XEOL-detected 
EXAFS of Ag3Ky-LTA. The x(k) k?-weighted 
EXAFS data and the corresponding phase- 
corrected Fourier transform (FT) best fits are 
shown in Fig. 1, A and B. 

The first and second peaks of the FT were 
fitted with, respectively, two oxygen (O) atoms at 
2.36 A (NI) and three Ag atoms at 2.82 A (N2). 
Additionally, the fit was completed with two lon- 
ger shells consisting of 0.4 K at 3.05 A (N4) and 
1.1 Ag at 3.3 A (N5). The Ag atoms coordinated 
to three other Ag atoms (Ag,) form tetrahedra- 
like Ag, clusters located inside the sodalite cage, 
as was shown with TEM (J7). The Agc atoms are 
each coordinated to two O atoms likely corre- 
sponding to extra-framework water molecules 
because of the short Ag-O distances and the fact 
that these O atoms are removed with a concom- 
itant loss of the sample PL upon dehydration of 
the material (supplementary materials). The ab- 
sence of contributions from the sodalite atoms 
[O, silicon (Si), and aluminum (Al)] embedding 
AgCLs in the EXAFS signal is discussed in the 
supplementary materials. 

The analysis of the XEOL-detected signal 
shows that the species at the origin of the bright- 
green PL observed in Ag3Ky-LTA are Ag, clusters 
with short Ag-Ag distances of 2.82 A, in which 
each Ag atom is bound to two water molecules at 
2.36 A. They are further surrounded by isolated K 
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and Ag cations likely positioned in the single six- 
membered rings (S6Rs) of the same sodalite 
cage. A twofold coordination of Age atoms, how- 
ever, does not correspond to an integer number 
of water molecules but rather to two different 
stoichiometries of x = 2 and x = 4, corresponding 
to a water coordination per Agc of 1.5 and 3, 
respectively. This analysis suggests the presence 
of a mixture of Ag,(H,O), and Ag,(H.O), with a 
~34/66 ratio (3 x 0.34 + 1.5 x 0.66 = 2). Attempts 
to use other models for analyzing the XEOL- 
detected signal, including the structure used for 
the analysis of the transmission-detected EXAFS 
presented below, were either not successful or 
led to unrealistic fitting parameters (supplemen- 
tary materials). 

We also analyzed the transmission-detected 
EXAFS collected simultaneously with the XEOL- 
detected data. The x(k) k?-weighted EXAFS data 
and the phase-corrected FT best fits of heat- 
treated Ag3;Ky-LTA are shown in Fig. 1, C and D. 
The distinct profiles compared with those col- 
lected with XEOL detection indicate that two dif- 
ferent average local environments of Ag atoms 
were measured simultaneously, which is consist- 
ent with the x-ray absorption near-edge structure 
analysis (supplementary materials). 

The first peak in the FT was fit with 2.5 O at 
2.34 A (N1) corresponding to the combination 
of the framework O (O,) from the S6Rs rings 
and the H,O ligands, as shown in the XEOL 
analysis. The second peak in the FT is a multi- 
peak composed of 2.6 Si/Al atoms (N2) at 3.26 
to 3.30 A corresponding to a fraction of non- 
luminescent Ag cations located near the center 
of the S6Rs (Aga) (fig. $13), not detected in 
XEOL-EXAFS. The second peak in the FT anal- 
ysis also contained a weaker Ag-Ag contribution 
(N3) of 1.7 Ag at unusually short distances of 
2.70 A. This feature corresponds to the remain- 
ing part of the silver atoms Agc (~57%) that are 
coordinated to ~3 (1.7/0.57) silver neighbors and 
are forming Ag, clusters inside the sodalite cage. 
The 4% discrepancy between the Agc-Age (2.70 
to 2.82 A) distances found by the two detection 
approaches suggests that XEOL measured pref- 
erentially the excited state structure of the clus- 


ters (supplementary materials). Agc in Ag, clusters 
are coordinated to 2.1 O (1.2/0.57) from water 
molecules. Additionally, four shells (N,, to Nz) 
consisting of K and Ag were detected at liquid- 
nitrogen temperatures between 2.97 and 4.49 A 
corresponding to Agc-K or Agc-Agr distances 
from basal Agc in Ag, tetrahedra associated with 
the absence or presence, respectively, of a water 
molecule sandwiched between the two atoms 
(fig. S13). These shells are complemented by 
two long-distance contributions (Ng and No) 
corresponding to Agc-Agp from apical Age in Ag, 
tetrahedra and Agp-Agp (fig. S7a) detected at 5.24 
and 6.13 A at LN. The distinct combination of 
distances of N,, to Nog shells closely fit the Ag-LTA 
sodalite crystallographic model (fig. S7B), fully 
supporting the AgCL local structures proposed. 

The EXAFS investigation shows that the emit- 
ters in Ag3Ko-LTA consist of ~40% of Ag,(H20),, 
and 60% of Ag,(H.O),. tetrahedra-like clusters 
located at the center of the sodalite cage. These 
structures are presented in Fig. 1, Eto J. The 
clusters are coordinated at their faces by two or 
four water molecules located near the center of 
the S6Rs and sandwiched between three Agc and 
one Ag (or K cation). AgCLs consisting of 57% of 
the total number of exchanged Ag atoms are 
mostly surrounded by the remaining 43% iso- 
lated Agr cations plus some additional K cations 
in the S6Rs. This indicates that Ag cations in 
partially exchanged Ag;Ky-LTA concentrate (six 
to seven Ag atoms instead of three expected from 
the Ag stoichiometry) in a limited fraction of the 
sodalite cages (~45%). (23). 

We used a combination of DFT and TD-DFT to 
model AgCLs and probed their charges with the 
natural bonding orbital approach. Two stable 
isomers—[Ag,(H,0),(Sig4H24,036)], @ = 2 and @ = 4, 
showing the best agreement between calculated 
and measured structures and absorption spectra— 
were obtained when applying a +2 charge pref- 
erentially localized on the Ag,CLs but extending 
toward the cluster surrounding (tables S4 and 
85). The doubly charged Ag,CLs exhibit a closed- 
shell electronic configuration, in which the Ag 4d 
shell is completely filled, and the two remaining 
valence 5s electrons are delocalized over the 


Fig. 1. Ag K-edge XEOL and A 
transmission-detected 
EXAFS and FTs of heat- 
treated Ag3Ky9-LTA and 
derived structures. 
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SiO, or AIO, Q GH, 
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cluster. Within a superatom model, two electrons 
associated to a metal cluster correspond to the 
smallest magic number with enhanced stability 
(24, 25), which is consistent with analogous the- 
oretical work on Ag,CLs (26, 27). 

Both isomers [Ag,(H20),, v = 2 and 4] consist 
of pseudotetrahedral Ag, clusters located, unlike 
the experimental structures, in an off-centered 
position in the sodalite cage, with one or two Ag 
atoms coordinated directly to Oy (supplementary 
materials). In Ag,(H.O),, each Ag atom is coordi- 
nated on average to two O atoms (water plus Of) 
with an average Ag-Ag bond distance of ~2.79 
and 2.92 A in the ground and excited state, 
respectively (fig. S14). In Ag,(H2O)»s, an average 
Ag-Ag distance of 2.87 A in the ground state and 
a mean O coordination close to 1.3 were obtained 
(fig. S15). These calculated structures featuring a 
5% increase in the Agc-Agc distances from the 
ground to the excited states are in good agree- 
ment with the experimental results (supplemen- 
tary materials). 

The calculated frontier orbitals for both 
Ag,(H.0),”* and Ag,(H.0),.* isomers (Fig. 2A 
and fig. S16, respectively) are composed of a con- 
tribution of ~50% from Ag 5s atomic orbitals 
and of up to 25% from the O states of the sur- 
rounding O;, and H,O, as shown in the density 
of states curves (fig. S18). A doubly occupied 
HOMO of totally symmetric s-type orbital forms 
the ground state (‘S,) and two sets of three sin- 
glet ('P) and three triplet (?P) LUMOs consisting 
of one-node p-type orbitals form the expected 
lowest-lying cluster orbitals for a cluster system 
with two skeleton electrons. Both states in each 
isomer have similar atomic orbital compositions 
(fig. S18), suggesting that the excitations are 
mainly localized on the cluster. 

The key role of water ligands in AgCLs’ elec- 
tronic properties is highlighted in the energy 
level diagram (Fig. 2B and fig. $17) of Agy(H»0),"* 
and Ag,(H.0O),”* along with water-free Ag,°* clus- 
ters as reference. In water-free unperturbed Ag,”", 
the ground state is the HOMO 'So, and the LUMOs 
consist of threefold degenerated singlet 'P and 
triplet ?P excited states. Upon coordination with 
water, the ligand field lifts the degeneracy of the 
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LUMOs into six excited states: three singlet 'P 
[total spin quantum number (S) = 0; total orbital 
angular momentum quantum number (ZL) = 1; 
magnetic quantum number (77) = -1, +1, or 0) 
and three triplet ?P (S = 1; L = 1; m) = -1, +1, or 0) 
states. The absorption occurs from the ground 
state corresponding to two electrons of oppo- 
site spins on a HOMO s-like orbital (’Sp) to 
the singlet excited states 'P (3.5 and 3.7 eV) that 
correspond to one electron on a s-like orbital and 
one electron on LUMO, LUMO+1, or LUMO+2 p- 
like orbitals, with the two electrons having op- 
posite spins. 

These transitions feature large oscillator 
strengths f because they are allowed by spin 
and angular momentum selection rules. The strong 
overlap of the high-energy triplet ?P (S = 1, L = 1, 
my = 0) state with the 'P singlet states ensures, in 
combination with large spin-orbit coupling ex- 
pected for Ag, an enhanced intersystem crossing. 
Upon light excitation, a fraction of 'P singlet 
states population is transferred to the high- 
energy triplet state that finally decays into the 
low-lying *P (S = 1, L = 1, m = -1 or +1) triplet 
state. As shown in Fig. 2B, the intensity of the 
ligand-field splitting of the p-like LUMOs is 
directly proportional to the number of water 
molecules coordinating AgCLs, decreasing the 
band gap when going from Ag,(H2O).”* to Agy 
(H,0),?* isomers. The bright-green emission of 
Ag,(H,O),7* then occurs from the lowest-lying °P 
triplet excited state 2P (S = 1, L = 1, m, = -1) to the 
ground state 'So. 

The modeled absorption spectra for the 
Ag,(H»0),"* and Ag,(H.0)» ** isomers (Fig. 3) are 
in excellent agreement with the steady-state opti- 
cal experimental data. Calculated absorption 
peaks found at 343 and 320 nm in Ag,(H,0),7* 
and Agy(H»0).”", respectively, closely match the 
experimental excitation peaks at 340 nm (3.64 eV) 
and 310 nm (4.00 eV). These results also confirm 
the assumption that the 37/63 intensity ratio of the 
two main emission peaks is directly related to the 
34/66 fraction ratio of Ag,(HO), and Ag,(H,O), 
present in Ag3Ky-LTA composites. Last, the green 
PL energy obtained experimentally at 550 nm 
(2.25 eV) fits the transition energy of 556 nm 
(2.23 eV) of the modeled transition from the 
relaxed lowest-lying °P triplet excited state to the 
‘So ground state (Fig. 2B). 

To determine the decay time scale and ener- 
gies of Ag3;Ko-LTA modeled optical transitions 
and to verify the triplet nature of its bright green 
PL, we performed a global analysis of the decays 
obtained with femtosecond fluorescence up- 
conversion, time-correlated single-photon count- 
ing (TCSPC) and nanosecond luminescence 
time-resolved spectroscopies. Decays obtained 
with femtosecond fluorescence up-conversion 
in the range of 410 to 570 nm reveal three time 
components of 0.5 and 2.6 ps related to relaxation 
processes and >50 ps attributed to the 'P-to-'S, 
transition (Fig. 4, A and B; fig. S24, decay traces; 
and table S7). The amplitude-to-wavelength de- 
pendence (AWD) of the first two ultrafast com- 
ponents (maxima centered at 510 and 530 nm) 
are slightly blue-shifted relative to the maximum 
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Fig. 2. Frontier orbitals of [Ag4(H20)4(Siz4H2403¢)]*" and energy level diagram of Aga(H20)27* 
and Aga(H20),2* clusters in Ag3Ko-LTA. (A) Frontier orbitals consist of one single symmetric s-type 
HOMO (‘So) and three singlet one-node p-type /P (m, = -1, +1, or 0) LUMOs (p,, Py, Pz) delocalized over 
all the Ag and O atoms of the cluster. Atom colors are Si, gray; O, red; Ag, blue; hydrogen, white. 

(B) Energy level diagram showing the ground-state ‘So and the excited states °P and !P of water-free 
unperturbed Ag,°* clusters and the ground-state !So and the six singlet !P and triplet °P excited states of 
Aga(H20)2** and Aga(H20)4** perturbed by means of water ligand field interaction. Blue arrows 
represent the allowed transitions, and the green arrows represent the luminescent transitions between 


the relaxed states. 


of the steady-state emission spectrum (550 nm), 
suggesting the rapid depopulation of 'P (S = 0; 
L =1; my = -1, +1) singlet excited states. These 
ultrafast components attributed to nonfluores- 
cent short-lived intermediate species formed after 
the relaxation of 'P Franck-Condon states match 
closely the energy of the 1p (S =0,L=1,m=-))- 
to-'S, transition predicted with TD-DFT (Fig. 2B). 
These intermediate states rapidly convert into 
fluorescent *P triplet states lying at similar ener- 
gies via intersystem crossing (Fig. 4, B and D). 

This model is confirmed through the analysis 
of the PL decays in the micro- to millisecond 
range that reveal three time constants of 423 ns, 
10.6 us, and 116 us. On the basis of the AWD 
presented in Fig. 4C, the 423-ns component is 
attributed to an excited state with an intense emis- 
sion peaking at 520 to 540 nm (2.38 to 2.30 eV), 
with estimated radiative and nonradiative rates 
of 5.43 x 10° sand 18.2 x 10° s"}, respectively. 
The close resemblance of the AWD of this state 
with the stationary emission spectrum indicates 
that these long-lived species are at the origin of the 
bright-green emission observed in Ag3Ko-LTA. This 
emission occurring from long-lived states charac- 
teristic for spin-forbidden transitions points 
toward their peculiar triplet nature. The other 
decay times of 10.6 and 118 1s are attributed to 
two different triplet excited states, with weak 
emissions centered at 630 and 680 nm (1.97 and 
1.82 eV), respectively (Fig. 2B; fig. S25, decay 
traces; and table S8), which are associated with 
the presence of residual amounts of emissive 
species such as AgCLs with different size and/or 
water coordination (0). 

The triplet nature of Ag3;Ko-LTA bright-green 
emission is further corroborated by the remark- 
able enhancement of the PL accompanied by a 
dramatic increase of the decay time from 423 ns 
to 106 us for the 540-nm main emission at low 
temperature (77 K), whereas the faster components 
remained mostly unaffected (figs. S26 to $33). 
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Fig. 3. Steady-state excitation-emission of 
Ag3Ko-LTA. (A) 2D excitation—emission plot. 
(Inset) The picture of an x-ray-irradiated sam- 
ple under 366 nm illumination. (B) Excitation 
spectrum Adetection = 555 nm of as-prepared 
Ag3Ko-LTA. Calculated ‘Sg HOMO to 'P (S = 0; 
L =1; m= -1, +1) LUMOs absorption spectra 
of x = 2 and x = 4 [Ag,(H20),]** isomers 
showing a good agreement with experiments. 


This result shows that excited-state kinetics, in 
which nonradiative decay channels are hindered 
at low temperature and characteristic of triplet- 
state emissions, is involved. Hence, given the close 
correspondence between the computational and 
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Fig. 4. Time-resolved spectroscopy of Ag3Kg-LTA. (A and B) AWD 3D time-resolved fluorescence 
emission spectra in 50-ps time window obtained with (©) femtosecond fluorescence up-conversion, 
in 1-ms time window through nanosecond luminescence. (D) Schematic illustration of the main 
electronic states involved as a function of energy (on the vertical axis). 


the photophysics results, the cluster-ligand inter- 
action as included in the model of a two-electron 
doubly charged AgCL, possibly with a fraction of 
the charge located on the sodalite cage, explains 
the occurrence of long-lived bright luminescent 
states in Ag3Ko-LTA. 

By measuring exclusively the local structure of 
the emissive Ag clusters in partially exchanged 
LTA zeolites, XEOL-XAFS has allowed for the 
first time the unambiguous identification of their 
functional structures. DFT modeling based on 
these detailed structures showed that the double 
positively charged Ag,(H,O), and Ag,(H.O), clus- 
ters, in which water ligand molecules modulate 
the HOMO-LUMO gap, behave as confined two- 
electron helium or alkaline earth-like superatom 
quantum systems that mainly emit via their long- 
lived lowest-lying °P triplet excited state, as con- 
firmed with time-resolved optical spectroscopy. 
We anticipate that similar photophysical proper- 
ties may also apply to luminescent AgCLs con- 


Grandjean et al., Science 361, 686-690 (2018) 


fined in other inorganic and organic scaffolds. 
This is likely the case for AgCLs confined in fully 
exchanged LTA or in FAU zeolites that possess 
very similar structural and luminescent properties 
(0). This new understanding of the mechanism 
of AgCLs’ bright luminescence and its expected 
dependence on the interaction with oxygen ligands, 
electron confinement, electrostatic interaction, 
and charge transfer to the surrounding silver 
atoms, which differ from the single Ag cations 
emission model proposed earlier (28-30), should 
lead to substantial improvements in the rational 
design of AgCLs optical properties. 
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SOLID-STATE PHYSICS 


Twistable electronics with dynamically 
rotatable heterostructures 


Rebeca Ribeiro-Palau””*{+, Changjian Zhang”**, Kenji Watanabe“, 
Takashi Taniguchi*, James Hone”, Cory R. Dean't 


In heterostructures of two-dimensional materials, electronic properties can vary 
dramatically with relative interlayer angle. This effect makes it theoretically possible 

to realize a new class of twistable electronics in which properties can be manipulated 

on demand by means of rotation. We demonstrate a device architecture in which a layered 
heterostructure can be dynamically twisted in situ. We study graphene encapsulated by 
boron nitride, where, at small rotation angles, the device characteristics are dominated by 
coupling to a long-wavelength moiré superlattice. The ability to investigate arbitrary 
rotation angle in a single device reveals features of the optical, mechanical, and electronic 
response in this system not captured in static rotation studies. Our results establish the 
capability to fabricate twistable electronic devices with dynamically tunable properties. 


he weak van der Waals forces between the 

atomic planes in two-dimensional (2D) ma- 

terials make it possible to fabricate devices 

with arbitrary rotational order. Devices could 

be designed in which electronic properties 
are controlled by varying the relative twist angle 
between layers (1). Several studies have estab- 
lished that in heterostructures assembled from 
2D crystals, electron tunneling between layers 
varies strongly with rotation (2-8). In twisted 
bilayer graphene (two monolayers in direct con- 
tact but with an angle mismatch between the 
layers), several new phenomena have been pre- 
dicted and observed, including topological val- 
ley transport (9-13) and superconductivity (14), 
as a consequence of angle-dependent interlayer 
coupling. Likewise, the formation of interlayer 
excitons in transition metal dichalcogenide het- 
erostructures is highly sensitive to angle (15-17). 

The effect of rotational alignment between con- 
ducting and insulating 2D layers can be equally 
important. An example is provided by graphene 
coupled to hexagonal boron nitride (BN). The 
closely matched lattice constants create a large 
moiré superlattice that develops near zero angle 
mismatch (/8, 19). This superlattice substantially 
alters the graphene band structure, opening an 
energy gap at the charge neutrality point (CNP) 
and creating replica Dirac points at higher en- 
ergies (20-22). 

Several techniques have been developed to 
fabricate layered heterostructures with controlled 
rotation between the layers, including optical 
alignment of crystal edges (18, 20-22), rota- 
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tional alignment (23) during assembly, and self- 
alignment through thermal annealing (24, 25). 
However, in each case, an a priori understanding 
of the crystallographic orientation of each layer 
is required before assembly; motion between 
the layers during assembly makes it difficult to 
achieve precise angle control; and, most impor- 
tantly, once assembled, the angle cannot be fur- 
ther modified. Here we present an experimental 
technique that provides on-demand control of 
the orientation between layers in a van der Waals 
heterostructure. In a BN/graphene/BN structure, 
we demonstrate in situ control over the length of 
the moiré potential and thus achieve dynamic 
tunability of the optical, mechanical, and elec- 
tronic properties of the system. 

Figure 1A shows a cartoon schematic of our 
device design. Using the mechanical assembly 
technique described in (26), graphene was placed 
on top of a large flake of BN, then etched into 
a Hall bar shape with an oxygen plasma. The 
graphene layer is intentionally misaligned to 
this BN, producing a short-wavelength moiré 
potential that does not substantially alter the 
intrinsic graphene band structure (27). Next, a 
preshaped BN structure was transferred on top 
of the graphene. Last, electrical contacts were 
patterned onto the exposed leads of the graphene 
(fig. S1). Because of the low mechanical friction 
between graphene and BN, we could freely ro- 
tate and translate this top BN layer by using an 
atomic force microscope (AFM). Pushing on one 
of the arms of the uppermost BN structure ro- 
tates this layer (Fig. 1, B to D) and changes its 
crystallographic orientation with respect to the 
graphene layer. The moiré superlattice wavelength, 
A, generated between these crystals is given by 


(1+ d)a 
s/2(1 + 8) [1 — cos(6)] + 8? 


(1) 


where 6 = 0.017 is the lattice mismatch between 
graphene and BN, a is the lattice constant of 
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graphene, and 0 is the rotational mismatch be- 
tween the layers. By rotating the top BN layer, 
the wavelength of the resulting moiré superlat- 
tice can be dynamically varied. 

Imaging with the same AFM provided a real- 
time measurement of the orientation of the rotat- 
able BN layer relative to the reference frame of 
the AFM, which we label as the absolute angle 04 
(for example, Fig. 1, B to D). More important is 
the relative angle, 0, between the rotatable BN 
and the encapsulated graphene crystal lattices. 
To determine this angle, we identified and mea- 
sured angle-dependent features in the optical 
Raman spectrum of the heterostructure. 

Figure 1F shows a series of Raman spectra 
measured for 6,4 from 34° to 64° The most strik- 
ing variation in the Raman spectra was an in- 
crease in the full width at half maximum of the 
2D peak (FWHMsgp) (19, 28); a well-defined max- 
imum occurred for this device at ~34.2° (Fig. 1G). 
On the basis of previous work (28), we interpret 
the peak position as corresponding to zero-angle 
alignment between the BN and graphene crystals. 
This peak provides a reference that we could use 
to determine 6 for any orientation of the BN layer. 

Our results highlight the robustness of Raman 
spectroscopy as a tool to characterize the rota- 
tional order in these van der Waals heterostruc- 
tures. Moreover, previous efforts to study effects 
of twist angle in this system required numerous 
samples with different fixed angles, whereas here 
we demonstrate a mapping of the angular depen- 
dence with better than 0.2° precision in a single 
tunable device. One notable disagreement with 
previous results (28) is an overall narrower 2D 
linewidth in our devices. In previous work, this 
reduced linewidth was proposed to be a conse- 
quence of a reduction of the in-plane strain in 
fully encapsulated structures, so that areas of full 
commensuration to the aligned BN disappear 
(19). However, our observation of the same linear 
trend and a reduced linewidth for the misaligned 
position, for which no commensurate regions are 
expected, suggests a different origin. We specu- 
late that this linewidth reduction was caused by 
the change in the dielectric environment of the 
graphene caused by the presence of the second 
BN, as proposed in (29). Thus, our results reopen 
the question of the existence of commensurate 
states in encapsulated graphene devices. We addi- 
tionally have identified a shift in the position of 
the 2D and G peaks with relative angle (fig. S2). 

Mechanical resistance while pushing the BN 
imparted a torque to the AFM tip, causing it to 
cant away from a vertical position and produce a 
voltage difference in the AFM’s photodetector. 
This effect can be used to identify variations in 
frictional forces (Fig. 2A). When sliding the top 
BN layer with the AFM tip in a translational mo- 
tion far from alignment, we identified three re- 
gimes (Fig. 2B): (i) sliding friction between the 
tip and the substrate before the tip encountered 
the BN, (ii) static friction when the tip encoun- 
tered the BN but resisted translation, and (iii) 
dynamic friction once the BN began to move. 

Figure 2C shows a plot of the change in the tip 
deflection under continuous rotation (dynamic 
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Fig. 1. Rotatable heterostructures. (A) Schematic cartoon of the device 
structure and the experimental technique. BN, hexagonal boron nitride. (B to 
D) AFM image of a fabricated device showing three different orientations of the 
top BN. The angle identified in each panel is the absolute angle referenced to 
the AFM coordinate system (@,). The images were acquired by the same AFM 
used to rotate the BN layer. Scale bars, 1 um. (E) Schematic illustration of the 
moiré superlattice arising between graphene (red) and BN (blue) at zero angle. 
The moiré wavelength, A, is shown. (F) Raman spectrum of the device shown in 


Fig. 2. Mechanical properties. (A) Schematic A 
description of friction measurements. 

When the AFM tip encountered the BN 
structure, it canted, causing a repositioning 

of the reflected laser spot in the four-quadrant 
photodetector. The resulting voltage 

difference was proportional to the tip cant AFM {7 
angle (referred to here as the tip deflection) tip — q 
and served as a measure of the torque force J 
acting at the end of the tip. (B) Tip deflection 
versus time in a translational push of the 
upper BN structure. Different regimes of the 
measurement were identified: (i) As the 


Graphene 


BN 
Si/SiO, 


Intensity (a.u.) 


2600 


Raman shift (cm) 


2800 


8, (degrees) 


(B), (C), and (D) for 64 between 34.2° + 0.2°and 36.0° + 0.2° The black curve shows 
an additional measurement acquired at ~64° a.u., arbitrary units. (@) FWHM of 
the 2D peak as a function of the absolute angle. All Raman measurements were 
taken with the gate bias held at V, = O V. The peak FWHM position identifies 
zero-angle crystallographic alignment. Error bars represent the precision with 
which 6 can be determined from the AFM topographic images. The dashed line 
represents the FWHM measured for all angles larger than ~2° away from perfect 
alignment, with the shaded region representing the associated uncertainty. 


Photodetector B (i) (ii) (iii) 
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| 


tip dragged along the surface, tip-substrate Cc 
friction resulted in a steady-state tip 
deflection. (ii) When the tip encountered 
the BN structure, it initially resisted 
translation, and the tip deflection increased. 
We refer to this as the static friction regime. 
(iii) Once the BN was in motion, the tip 
deflection relaxed slightly, providing a 
measure of the dynamic friction at the 
BN-graphene interface. (C) Tip deflection 
versus absolute angle, measured during 

a continuous rotation of the BN. Two 

peaks were observed, spaced 60° apart. 
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friction), in which we subtracted the background 
caused by piezoelectric drift and residual friction. 
Two prominent peaks are separated by 60° This 
result closely resembles a previous measurement 
of friction between two graphitic structures in 
which a transition from superlubricity (where 
the structures are in an incommensurate posi- 
tion and the atomic shear forces are negligible) 
to a dissipative state was observed at commensu- 
rate angles of the threefold symmetric hexagonal 
lattices (30-32). However, because of the lattice 
mismatch between graphene and BN, there is 
not true lattice commensurability at any angle, 
and the increase in the friction should have a 
different origin. Recent numerical simulations 
suggest that contributions of the moiré super- 
lattice to the frictional force cannot be neglected 
and are expected to be maximal for aligned layers 
(33), which could explain our experimental re- 
sult. Our device structure allows us to study me- 


chanical properties, such as frictional force, in 
atomically flat materials without rugosity con- 
tributions. These results also highlight the possi- 
bility of using the friction response as an in situ 
method to monitor and control layer alignment 
in heterostructures. 

Our device design enabled us to measure elec- 
tron transport in the active layer (in this case, 
graphene) while changing the relative orienta- 
tion of the overlayer. Figure 3A shows a plot of 
the four-terminal resistance of the graphene 
layer as a function of back gate voltage V, for 
different values of 6 at room temperature. Near 
6 = 0, additional satellite resistance peaks appear 
symmetrically in density around the CNP. This 
result is consistent with the emergence of satel- 
lite Dirac points induced by scattering from the 
moiré superlattice potential (78, 20-22). 

As @ increased from zero, the satellite peaks 
diminished in intensity and moved further from 
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the CNP to higher gate values. To analyze this 
behavior more quantitatively, Fig. 3B plots the 
satellite peak position in V, versus @ deter- 
mined from the AFM imaging. The measured 
position shows excellent agreement with the 
values of carrier density n at which the full 
filling of the miniband occurs, n = 8/ V 302 7 
where the carrier density and gate voltage are 
related by n = C,(V, — Venp), Cz is the capacitive 
coupling of the electrostatic gate, and Ais given 
by Eq. 1 (18, 20-22). The error bars in Fig. 3B, 
~+0.2°, reflect the precision with which 6 can 
be determined from the AFM topographic images. 
Determining 0 from the V, position of the satel- 
lite peak at low temperature provides a more ac- 
curate measurement with uncertainty less than 
+0.1° However, this method of determining the 
angle is limited to only a few degrees where the 
satellite peak remains within an accessible den- 
sity range. 
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Fig. 3. Electronic transport properties. (A) Four-terminal resistance 
(Rap) as a function of V, for different alignments of the graphene/BN 
structure, acquired at room temperature. (B) Position of the satellite peak 
in gate voltage as a function of the relative angle. The +0.2° error in the 
angle reports the precision achieved with the AFM imaging in tapping 
mode. The vertical error bars represent the maximum uncertainty with 
which we can determine the position of the satellite peak owing to the 
broadening of the peak. (C) Linear dependence of the maximum value of 
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the four-terminal resistance at the satellite peak at 300 K (red) and 1.7 K 
(blue) as a function of the moiré length. (D) Energy gap, measured by 
thermal activation, for the satellite peak (circles) and the charge neutrality 
point (diamonds) as a function of the relative angle. Open symbols 
represent a repeated measurement at a given angle after moving through 
other angles and thermally cycling. (E) Schematic band structure for 
native graphene and (F) for a graphene-BN heterostructure with a 

small twist angle. E, energy; k, momentum. 
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Plotting the resistance of the satellite peak 
versus A for a single device revealed an appar- 
ently linear variation in the magnitude of the 
satellite peak resistance (Fig. 3C). The resistance 
of the CNP did not change linearly with 0 in this 
range (fig. S3). The origin of this linear depen- 
dence is not known. This striking observation 
highlights how a physical phenomenon previ- 
ously obscured (18, 20-22) by sample-to-sample 
variations became clear by varying rotation in a 
single sample. 

We measured the energy gap versus angle at 
both the central and satellite Dirac points by 
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Fig. 4. Compilation of observations with the 
angle control technique. (A) Four-terminal 
resistance as a function of the relative angle 
measured at a carrier density of -1.9 x 10 cmt. 
(B) Tip deflection in friction measured simulta- 
neously with the electronic transport. (C) FWHM 
of the 2D peak of the Raman spectrum as a 
function of the relative angle. All measurements 
were performed in the same device. 
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thermal activation. The gap magnitudes (Fig. 3D) 
were in good agreement with electronic transport 
measurements in encapsulated (34) and nonen- 
capsulated devices (19, 20), optical measurements 
made in epitaxial BN/graphene structures (35), 
and theoretical calculations (36). The energy gap 
of the satellite peak decreased smoothly away 
from 0 = 0. In contrast, the energy gap at the 
CNP displayed a more complex behavior near 
6 = O and only decreased appreciably for |6| > 2° 

The origin of the extra structure observed for 
the CNP remains to be understood. However, 
this result does not appear to be simple experi- 
mental noise because several gap values were 
confirmed to be reproducible when measured 
nonconsecutively (i.e., after thermally cycling 
and rotating through different angles and back). 
The difference in behavior of the two energy gaps, 
which would be impossible to observe in a study 
that used multiple samples at fixed angles (34), 
reflects their different physical origins and high- 
lights the importance of these results in fully 
understanding the band structure modifications 
resulting from variations in angular alignment 
(Fig. 3, E and F). The persistence of an energy gap at 
the CNP in encapsulated devices for angles beyond 
that at which a commensurate-incommensurate 
transition was previously identified (19) suggests 
that this energy gap is not related solely to the 
presence of a commensurate state. 

Figure 4 shows a direct comparison of the 
optical, mechanical, and electronic response 
versus angle measured in the same device. The 
four-probe resistance (Fig. 4A), the maximum 
tip deflection signal (friction; Fig. 4B), and the 
maximum FWHM of the Raman 2D peak (Fig. 
4C) coincide exactly, confirming the relation be- 
tween these properties. The four-probe resist- 
ance (Fig. 4A) and friction response (Fig. 4B) 
were acquired simultaneously at a fixed carrier 
density of 1.9 x 10’? cem™ (corresponding to the 
dashed line in fig. S4) while continuously rotat- 
ing the BN layer. At this relatively high carrier 
density, the bulk resistance was modulated by 
more than an order of magnitude over <2° of 
rotation (at cryogenic temperatures, this modu- 
lation increases to more than two orders of mag- 
nitude; supplementary materials). 

Our demonstration of rotatable heterostructures 
with dynamically tunable device characteristics 
presents an opportunity in device engineering. Al- 
though we investigated BN-encapsulated graphene 
as a model tunable system, this technique is readily 
extendable to generic heterostructures fabricated 
from 2D materials. In addition to band structure 
tunability, emergent phases such as superconduc- 
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tivity and magnetism may be controllably varied 
with rotation. 
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Plant iron acquisition strategy 
exploited by an insect herbivore 


L. Hu’, P. Mateo”, M. Ye’, X. Zhang’, J. D. Berset’, V. Handrick®*, D. Radisch*, 
V. Grabe®, T. G. Kéllner’, J. Gershenzon*, C. A. M. Robert}, M. Erb’+ 


Insect herbivores depend on their host plants to acquire macro- and micronutrients. Here 
we asked how a specialist herbivore and damaging maize pest, the western corn rootworm, 
finds and accesses plant-derived micronutrients. We show that the root-feeding larvae use 
complexes between iron and benzoxazinoid secondary metabolites to identify maize as a host, 
to forage within the maize root system, and to increase their growth. Maize plants use 

these same benzoxazinoids for protection against generalist herbivores and, as shown here, 
for iron uptake. We identify an iron transporter that allows the corn rootworm to benefit from 
complexes between iron and benzoxazinoids. Thus, foraging for an essential plant-derived 
complex between a micronutrient and a secondary metabolite shapes the interaction between 


maize and a specialist herbivore. 


ron (Fe) can be a limiting micronutrient for 

plants and herbivores (7). Plants increase Fe 

availability by secreting reducing agents and 

Fe chelators, so-called phytosiderophores, into 

the rhizosphere (2). Benzoxazinoid secondary 
metabolites, which are produced and exuded by 
grasses such as maize, may act as phytosider- 
ophores in addition to protecting plants against 
herbivores (3, 4). The western corn rootworm 
(WCR), a worldwide maize pest, can tolerate 
and sequester benzoxazinoids (5). In addition, 
root-feeding WCR larvae prefer benzoxazinoid- 
producing maize plants and require benzoxazi- 
noids to identify crown roots as the preferred feeding 
site (6). WCR crown root damage reduces plant 
growth (6). 

Here, we investigate the potential role of 
benzoxazinoids as phytosiderophores and WCR 
foraging cues. To identify the chemical motif 
used by WCR, we evaluated WCR behavior on 
different maize benzoxazinoid mutants (Fig. 1 and 
fig. SI). WCR larvae preferred wild-type (WT) 
rather than benzoxazinoid-deficient bx and bx2 
mutant plants. The larvae also preferred WT to bx6 
mutants, which overaccumulate DIBOA-Glc at the 


Fig. 1. Bioactivity-guided genetic pathway 
fractionation reveals that feeding prefer- 
ences of WCR larvae are associated with 
7-O-methylated, N-hydroxylated benzoxazi- 
noid secondary metabolites in maize roots. 
(A to E) The benzoxazinoid biosynthesis 
pathway, including its major products (1 to 6), is 
shown. Gray boxes denote mutants and trans- 
genic plants with altered activity of the corre- 
sponding enzymes and their respective WT 
lines. Benzoxazinoid levels (+SE, n = 6 to 

10 biological replicates) of mutant and WT plants 
and WCR feeding preferences are shown (+SE, 

n = 12 to 20 choice situations with five larvae 
each). Full time courses are shown in fig. S1. FW, 
fresh weight. Asterisks indicate significant differ- 
ences (*P < 0.05; **P < 0.01; ***P < 0.001). 
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expense of 7-O-methylated and 8-O-methylated 
benzoxazinoids, suggesting that DIBOA-Glc is 
not a preferred benzoxazinoid. WCR larvae 
preferred bx13 mutant plants deficient in 8-O- 
methylated benzoxazinoids, suggesting that 8-O- 
methylated benzoxazinoids are not preferred. To 
test the influence of N-O-methylation, we over- 


expressed the DIMBOA-Glc O-methyltransferase 
ZMBxi2 (Bx120e) (fig. S1), resulting in plants with 
excess HDMBOA-Glc and reduced DIMBOA and 
DIMBOA-Glc (Fig. 1). WCR preferred WT over 
ZmBx12-overexpressing plants, suggesting that 
N-O-methylated benzoxazinoids are not preferred 
(Fig. 1). Thus, 7-O-methylated, N-hydroxylated 
benzoxazinoids such as DIMBOA and DIMBOA- 
Glc are associated with WCR feeding preference. 

We next investigated the role of benzoxazinoids 
for within-plant feeding preferences. WCR no 
longer distinguished between crown and primary 
roots of bxl, bx2, bx6, and Bx12oe plants, while 
preference was intact in bx13 plants (figs. S2 and 
83). The correlation between plant preference and 
tissue preference suggests that the same ben- 
zoxazinoids may mediate plant preference and 
within-plant foraging. Closer inspection of crown 
and primary root benzoxazinoid profiles revealed 
a positive correlation between DIMBOA accu- 
mulation in crown roots relative to primary roots 
and WCR preference for crown roots relative to 
primary roots across the different maize mutants 
(R? = 0.56; P = 0.03, fig. S2). 

Contrary to what was expected from the ex- 
periments with the benzoxazinoid mutant plants, 
purified DIMBOA and DIMBOA-Glc did not elicit 
WCR feeding preference (Fig. 2). The DIMBOA 
breakdown product MBOA, a potential volatile 
WCR attractant (7), was equally inactive (fig. S4). 
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Because N-hydroxylated benzoxazinoids such 
as DIMBOA and DIMBOA-GIc, but not N-O- 
methylated benzoxazinoids, can form stable com- 
plexes with Fe (3), we hypothesized that these 
complexes may serve as WCR foraging cues. 
Within the pH range of the maize rhizosphere 
(4.5 to 7.5) and given that DIMBOA is the main 
benzoxazinoid around maize roots (8), Fe com- 
plexes with two [referred to as Fe(III)(DIMBOA),] 
and three DIMBOA ligands [referred to as 
Fe(IID)(DIMBOA)s] are likely to accumulate at 
the root surface. Spectrophotometric measure- 
ments of synthetically prepared complexes (fig. S5A) 
confirmed the pH-dependent formation of 
Fe(III)(DIMBOA), and Fe(III)(DIMBOA), (collect- 
ively referred to as Fe-DIMBOA, fig. S5B). DIMBOA 
reacted preferentially with Fe, followed by alu- 
minum (Al) and molybdenum (Mo) (fig. S6). 
Formation of Fe-DIMBOA was observed on WT 
crown roots, but not on bz! roots (fig. $7). At esti- 
mated physiological doses of 1.26 x 10° mol cm 
(fig. S8), FedI1)(DIMBOA), and Fe(II)(DIMBOA)s, 
but not Al-DIMBOA, Fe-DIMBOA-Glc, and Fe- 
EDTA, elicited WCR feeding preference (Fig. 2D 
and fig. S9). Fe(ID(DIMBOA); increased WCR 
feeding across a range of physiologically rele- 
vant concentrations from 1.26 x 10“ to 1.26 x 10°” 
mol cm ” (fig. S10). Preference for Fe(ID)(DIMBOA), 
was independent of larval stage, previous ben- 
zoxazinoid exposure, and the presence of maize roots 
(Fig. 2E and fig. S11). Fe(I11)(DIMBOA), rescued 
recruitment of WCR larvae to bx] mutants in soil 
(Fig. 2F) and to individual bx] roots (Fig. 2G). 
Primary roots of WT plants became more attract- 
ive upon Fe(IID(DIMBOA); application and were 
as attractive as crown roots 6 hours after appli- 
cation (Fig. 2H and fig. S12). Host plant acceptance 
on bai mutant roots was fully complemented by 
Fe(III)(DIMBOA); (fig. $13). FedID)(DIMBOA); ap- 
plication also increased WCR feeding on rice and 
barley, two non-host plant species (fig. $13). 
Thus, in addition to primary metabolites (9, 10), 
Fe-DIMBOA mediates host recognition, accept- 
ance, and within-plant foraging of WCR. 

To investigate the impact of Fe-DIMBOA on 
WCR performance, we grew WT and bx mutant 
plants in nutrient solutions. In the presence of 
free Fe that can form complexes with DIMBOA, 
WCR larvae grew better on WT than 621 plants 
(Fig. 3A). No difference was observed in the pres- 
ence of Fe-EDTA or in the absence of Fe. Exogenous 
application of DIMBOA rescued larval growth on 
bx1 mutants (Fig. 3B). Fe concentrations in WCR 
mirrored larval performance (Fig. 3C). Thus, the 
interaction between free Fe and DIMBOA in- 
creases WCR Fe supply and performance. 

Fe supply is critical for plant performance. 
Leaf chlorosis, a typical sign of iron deficiency, 
was observed in bx, bx2, and bx6 mutant seed- 
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Fig. 2. Complexes between iron (Fe) and the benzoxazinoid DIMBOA mediate host recognition, 
acceptance, and within-plant foraging of WCR larvae. (A and B) Influence of pure DIMBOA and 
DIMBOA-GlIc on WCR feeding preference on benzoxazinoid-deficient mutant roots. (©) Chemical 
structure of the Fe complex Fe(III)(DIMBOA)3. (D and E) Effect of synthetic Fe(Ill)(DIMBOA)3 on WCR 
feeding preference for roots (D) and agarose cubes (E). (F) WCR feeding preference for soil-grown 
WT and bx1 mutant plants supplied with Feclll)(DIMBOA)3. (G and H) Influence of Fe(lII)(DIMBOA)s3 
complementation on within- and between-plant WCR feeding preferences. [+SE, n = 20, except (F), 
n= 10 to 11 choice situations with five larvae each]. Full time courses are shown in figs. S4, S9, and S12. 
Asterisks indicate significant differences (*P < 0.05; **P < 0.01; ***P < 0.001). 


lings growing in nutrient solution and potting soil 
supplemented with Fe salts (Fig. 3E and fig. S14). 
No chlorosis was observed in bx2 mutants supplied 
with Fe-EDTA (fig. S14). DIMBOA supplementa- 
tion rescued bx] mutants supplied with free Fe 
(fig. S14). Fe concentrations in the maize xylem 
mirrored chlorophyll patterns (Fig. 3, F and G). 
No differences in plant biomass were observed 
between genotypes (fig. S14F). Thus, the inter- 
action between free Fe and DIMBOA increases 
maize Fe supply. As maize and WCR Fe contents 
are correlated across treatments and genotypes, 
the positive effect of Fe-DIMBOA on WCR may 
be due to direct or plant-mediated effects. 

To better understand the connection between 
WCR performance and Fe availability, we manip- 
ulated the capacity of WCR to acquire Fe. We 
identified a WCR homolog of the human divalent 
metal transporter-1 (DMT1) (71), here named 
DvIRT]I (fig. S15). In Drosophila melanogaster, 
the DMT1 homolog Mv is required for Fe ho- 
meostasis and feeding decisions (12). DvIRT1 
rescued the growth of an Fe-transport-deficient 
yeast strain in the presence of free or complexed 
Fe, including Fe-DIMBOA (Fig. 4A). Silencing of 


DvIRTI in WCR (Fig. 4B) resulted in WCR Fe 
deficiency (Fig. 4C). DvIRTI silencing did not 
change WCR feeding preferences (Fig. 4D). How- 
ever, DuIRT] was required for the benzoxazinoid- 
dependent increase in WCR Fe supply and 
performance (Fig. 4E). Thus, DvIRTI enables 
WCR to acquire Fe in various forms, including 
Fe-DIMBOA. 

WCR may derive multiple benefits from Fe- 
DIMBOA. First, as Fe-DIMBOA is only produced 
by a few other plant species (13), it is a reliable 
host-recognition cue. Second, Fe-DIMBOA levels 
are highest for crown roots, which are a better 
food source for WCR than primary roots (5). 
Third, Fe-DIMBOA is accepted as a substrate 
by DvIRT1 and directly improves Fe homeostasis 
and WCR performance. Fourth, WCR larvae se- 
quester the DIMBOA breakdown product MBOA 
for self-defense against entomopathogenic nem- 
atodes (6). Fe-DIMBOA may therefore provide Fe 
as well as DIMBOA as an immune precursor. 

Grasses can use L-methionine-derived mugi- 
neic acids to chelate Fe (2). Here, we show that 
benzoxazinoids also contribute to the Fe supply 


of young maize plants. In addition, benzoxazinoids 
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Fig. 3. Interactions between free Fe and the benzoxazinoid DIMBOA 
determine growth and Fe homeostasis of maize and WCR larvae. 

(A and B) Growth of WCR larvae feeding on WT or bx1 roots supplied 
with different sources of Fe (+SE, n = 9 to 15 biological replicates) 

and pure DIMBOA (+SE, n = 15 to 20 biological replicates). (C and 

D) Corresponding Fe contents of WCR (+SE, n = 5 biological replicates, 
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Fig. 4. The Fe transporter DvIRT1 is required for Fe-DIMBOA-— 
dependent WCR performance. (A) Functional complementation of a 
yeast Fe-uptake mutant. Plasmids expressing At/RT1 (positive control, 
DvIRT1) or the empty vector pFL61 (negative control) were individually 
introduced into the Fe uptake—defective yeast DEY1453 strain, and the growth 
media were supplemented with different Fe sources. (B) Average expression 
levels of DVIRT1 in WCR larvae after feeding with double-stranded RNA of GFP 


(dsGFP, negative control) or DvVIRT1 (dsIRT1, +SE, n = 3 biological replicates). 
(D) Feeding preference of dsGFP- and ds/IRT1-fed WCR larvae (+SE, n = 19 to 
20 choice situations with five larvae each). (C and E) Fe content (+SE, n = 

4 biological replicates, with 3 to 5 larvae pooled per replicate) and growth (+SE, 
n = 20 biological replicates) of dsGFP- or dsIRT1-exposed larvae feeding on WT 
or bx1 roots with different Fe source treatments. Full time courses are shown in 
fig. S16. Asterisks indicate significant differences (*P < 0.05; **P < 0.01; ***P < 0,001). 


also act as resistance factors against different pests 
and diseases and shape the root microbiome 
(8, 14) but, as shown here, also increase the per- 
formance of a specialist herbivore. The diverse 
costs and benefits of the benzoxazinoid pathway 
for maize represent an optimization problem for 
plant breeding that may have contributed to the 
persistence of WCR as a damaging maize pest. 
Essential trace metals such as Fe influence 
herbivore performance and herbivore commu- 
nity composition (15-17). As trace metals are 
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often present as complexes in plants (2, 18), the 
ability of herbivores to detect and respond to 
these complexes may shape plant-herbivore inter- 
actions in agricultural and natural ecosystems. 
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Abrupt cloud clearing of marine 
stratocumulus in the subtropical 


southeast Atlantic 


Sandra E. Yuter’*, John D. Hader’”, Matthew A. Miller’, David B. Mechem® 


We document rapid and abrupt clearings of large portions of the subtropical marine low 
cloud deck that have implications for the global radiation balance and climate sensitivity. 
Over the southeast Atlantic, large areas of stratocumulus are quickly eroded, yielding partial 
or complete clearing along sharp transitions hundreds to thousands of kilometers in 

length that move westward at 8 to 12 meters per second and travel as far as 1000+ kilometers 
from the African coast. The westward-moving cloudiness reductions have an annual peak in 
occurrence in the period from April through June. The cloud erosion boundaries reduce cloud 
at =10-kilometer scale in less than 15 minutes, move approximately perpendicular to the 
mean flow, and are often accompanied by small-scale wave features. Observations suggest 
that the cloud erosion is caused by atmospheric gravity waves. 


he areal extent and temporal variability of 
subtropical marine low clouds strongly in- 
fluence the global radiation balance (J). 
Large, persistent areas of subtropical ma- 
rine stratocumulus clouds have been called 
Earth’s “climate refrigerator” (2). These low ma- 
rine clouds scatter solar radiation back to space 
and emit thermal radiation at a temperature close 
to the sea surface temperature (SST), yielding net 
radiative cooling of the climate system. The con- 
trols on areal coverage of marine stratocumulus 
have historically been cast either as a steady-state 
response to an imposed large-scale mean forcing 
(e.g., SST, subsidence, inversion strength, and 
radiative flux) (3, 4) or as being dominated by 
internal aerosol-cloud-precipitation processes 
(5-7). Most previous work on low marine cloud- 
iness transitions, including several studies address- 
ing pockets of open cells, has predominantly used 
frameworks that focus on responses to SST gra- 
dients and/or internal aerosol-cloud-precipitation 
processes and implicitly exclude external multi- 
day synoptic variability (8, 9). Yet a recent spec- 
tral analysis of multiyear global satellite datasets 
(0) found that cloudiness variance at multiday 
(3 to 50 days) time scales exceeds the seasonal 
variance over the subtropical southeast Atlantic 
and northeast Pacific and is only slightly lower 
than the seasonal magnitude over the subtrop- 
ical southeast Pacific. Abrupt changes in re- 
gional albedo from 0.9 (mostly low cloud cover) 
to 0.6 (ocean under mostly clear sky) will sub- 
stantially increase the shortwave radiation ab- 
sorbed by the ocean, which has pronounced 
implications for marine ecology and biogeo- 
chemistry (11). 
Cloudiness transition boundaries hundreds 
to thousands of kilometers long in the southeast 
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Atlantic exhibit abrupt reductions of cloud akin 
to pulling away a sun shade (Fig. 1 and movies S1 
to $4). The information below is derived from 
377 cloud erosion boundaries, identified from an 
examination of 1911 days over the period from 
8 May 2012 through 31 July 2017 by using mod- 
erate resolution imaging spectroradiometer 
(MODIS) corrected reflectance data and geo- 
synchronous satellite visible and infrared (IR) 
data (see methods for details). The cloud-eroding 
events in the southeast Atlantic occur year-round, 
with a peak monthly frequency of occurrence of 
roughly 20 events per month in May (Fig. 2). The 
abrupt cloudiness transitions, up to 1000+ km in 
meridional length, move westward at 8 to12ms7 
from 11°E to as far as 4°W longitude, as much as 
1500 km from the African coastline (Fig. 3). For 
any single cloudiness boundary, the speed of 
motion is relatively constant as the boundary 
moves westward (Fig. 3). At different locations 
along the same boundary, cloud reduction can 
result in partial or complete clearing (Fig. 1). 
Cloudiness transitions typically become discern- 
ible along the coast of southwestern Africa, gen- 
erally within a few hours of local midnight (Fig. 3 
and movies S2 and S4). The removal of cloud at 
night indicates that shortwave radiation-cloud 
feedbacks are not required. The prevailing winds 
at cloud level in this region are typically south- 
erly or southeasterly (72), and individual fea- 
tures in the low cloud field can be tracked moving 
northward relative to the westward-moving cloud 
erosion boundary (movie S1). The motion of the 
cloud erosion boundary is roughly perpendicular 
to the cloud-level winds and cannot be explained 
primarily by advection. 

High-resolution imagery of areas along the 
cloudiness boundaries often show a sharp and 
abrupt transition from overcast to clear or from 
overcast to broken clouds over spatial scales of 
<10 km (Fig. 1). Along some cloudiness bounda- 
ries, individual closed cells within the mesoscale 
cellular structure of the marine stratocumulus 


appear to be bisected (Fig. 1, F and G). Animated 
loops of visible satellite imagery reveal that 
cloud cover over ~10-km horizontal scales is 
removed or reduced in 15 min or less (movies 
S1 and S3). Near the cloud erosion boundaries, 
sets of wavelike, narrow (wavelength < 10 km), 
elongated, banded features roughly parallel 
to the cloudiness boundaries frequently occur 
(Fig. 1, E and G). 

Globally, satellite imagery reveals a rich array 
of multiday marine low-cloudiness variations, 
including sharp transitions between areas with 
nearly overcast and nearly clear conditions. Some 
of these cloudiness transition boundaries are as- 
sociated with the advection of low-level air from 
poleward extratropical cyclones into the subtropics 
(13-15). Cloud coverage, type, and height in the 
midlatitudes are clearly modulated by synoptic- 
scale baroclinic weather systems (J6-18). Some 
subtropical cloudiness reductions are associated 
with inversion strength anomalies originating 
from higher-latitude synoptic storms (10). Air- 
craft measurements off the coast of California 
show that near-shore clearings of the marine 
cloud field can be associated with synoptic-scale 
perturbations in the alignment and strength of 
the northeast Pacific ridge and concurrent meso- 
scale circulations along the California coast (19). 
These near-shore clearings expand during the 
day and contract toward the coast at night. 

Propagating atmospheric gravity waves (buoy- 
ancy oscillations) can manifest as transitions 
from overcast to broken cloud conditions and 
from thicker to thinner clouds. A westward- 
moving diurnal atmospheric gravity wave origi- 
nating over the heated, elevated terrain of the 
Andes modulates the liquid water path (LWP) 
in marine low clouds over the southeast Pacific 
(20-22). Intermittent atmospheric gravity wave 
trains originating from a disturbed subtropical 
jet can move equatorward into the southeast 
Pacific subtropical cloud deck, modulate the 
LWP, and reduce the cloud fraction during the 
day (23, 24). 

The characteristics of the southeast Atlantic 
cloud clearings are inconsistent with advection 
and the specific gravity-wave mechanisms doc- 
umented in the southeast Pacific. The westward 
motion of the southeast Atlantic cloudiness tran- 
sitions in an environment of prevailing south- 
erly and southeasterly cloud-level winds and a 
persistent, large-scale stable layer topped by an 
inversion that can serve as a waveguide (25) 
strongly suggest that the cloud erosion is caused 
primarily by an atmospheric gravity wave rather 
than advection. Additionally, the combination 
of the direction of motion, the phase speed, 
and the timing characteristics of these wave- 
like phenomena over the subtropical south- 
east Atlantic differs from that of documented 
atmospheric gravity waves over the subtropical 
southeast Pacific. The westward-moving diur- 
nal solitary wave over the southeast Pacific is 
associated with heating of elevated terrain and 
crosses the coast at about 1700 local time (20, 27). 
This westward-propagating atmospheric grav- 
ity wave has a wavelength of ~400 km, a phase 
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speed of 30 ms7’, and a wave depth of 5 km. The 
northeastward-moving atmospheric gravity wave 
trains that emerge from a disturbed subtropical 
jet and propagate toward the coast of Peru 
(23, 24) have wavelengths of 50 to 100 km, a ver- 
tical displacement of ~400 m, and a phase speed 
of ~15 ms’. The wavelength and vertical dis- 
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placement of the >1000-km-long southeast At- 
lantic cloud-eroding waves are currently not 
known, though the phase speed is in the range 
of 8 to 12 ms’' and evidence of waves with 
wavelengths of <10 km can be observed con- 
comitantly with the cloudiness transitions (Fig. 1, 
E and G). 
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Throughout the year in the southeast Atlantic, 
we also observed small-scale propagating cloud 
wave trains moving toward the south, southwest, 
southeast, and east. Figure 4 and movies S5 and 
S6 show an example when overlapping cloud wave 
trains moved southeastward, southwestward, and 
eastward coincident with westward-moving cloud 


Fig. 1. Examples of westward-moving cloudiness transitions in the 
southeast Atlantic off the coast of Africa in MODIS corrected reflec- 
tance imagery. Consecutive pairs of Terra (=10:30 a.m.) and Aqua 
(=1:30 p.m.) images (A and B) on 25 May 2013 and (C and D) on 
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26 May 2014. (E to G) Boxed areas in (B) to (D) shown at greater 

magnification. Latitude and longitude grids are shown with dashed 
lines. Movie loop animations of the 26 May 2014 case are shown in 
movies Sl (reflectance) and S2 (IR). 
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Fig. 2. Histogram of the daily probability 
of an abrupt cloudiness transition 

event occurring in a given month in the 
southeast Atlantic for the period from 

8 May 2012 through 31 July 2017. For 
example, the 0.658 daily probability value 
for the “Yes” events category for May 
means that on average 20.4 of the 31 days 
of May have westward-moving cloud 
erosion boundaries over the southeast 
Atlantic. The analysis was performed on the 
basis of pairs of Terra (morning) and Aqua 
(afternoon) visible corrected reflectance 
images. See methods for details. 
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Fig. 3. HovmGller plots illustrating cloud-clearing event timing and propagation speed. 

(A) Meteosat 600-nm visible imagery and (B) 12-um IR brightness temperature from OO UTC 
on 24 May 2014 to OO UTC on 31 May 2014. The dashed lines denote the approximate sunrise and 
sunset times. Data are averaged meridionally over a box from 6°W to 11°E longitude and 12.5°S 


to 17.5°S latitude. DD, day; HH, hour. 


erosion. Cloud wave trains associated with deep 
convective latent heating were documented off 
the northwest coast of Australia during seasons 
when deep convection was common over the 
nearby continent (26). A potential source for 
southward-moving atmospheric gravity waves 
in the southeast Atlantic is latent heating from 
tropical deep convection (27) over western equa- 
torial Africa and from the Atlantic Intertropical 
Convergence Zone. The eastward-moving wave 
trains in the southeast Atlantic may originate 
from the disturbed subtropical jet, similar to 
cloudiness perturbations seen in the southeast 
Pacific (23). 

Cloud generation by atmospheric gravity waves 
is extensively documented in the literature, and 
the associated physical mechanisms are reason- 
ably well understood (26, 28-32) (see supplemen- 


Yuter et al., Science 361, 697-701 (2018) 


17 August 2018 


tary text for details). These studies show that 
when moisture and stability conditions are favor- 
able, upward motions within the waves can yield 
propagating lines of clouds hundreds of kilome- 
ters in length moving on the order of 10 ms. 
By contrast, cloud-eroding waves defy easy ex- 
planation. The upward and downward motion 
associated with atmospheric gravity wave pas- 
sage should yield reversible changes. For a parcel 
with a given temperature and specific humidity, 
upward motion decreases the temperature, in- 
creases relative humidity, and, if saturation occurs, 
can yield cloud. Downward motion increases 
temperature and decreases relative humidity. If 
the moisture content of the air is unchanged, 
cloudiness would increase with upward motion, 
decrease with downward motion, and return to 
the prewave cloud state once the wave passes 


through. Yet observations show that cloud erodes 
along the boundaries in an irreversible manner 
that yields a reduced cloud fraction for hours 
(movies S2 and S4). 

For atmospheric gravity wave-induced cloud 
clearing to occur, a wave must be excited and 
then must propagate through a layer of the at- 
mosphere that includes the top of the marine 
boundary layer where the clouds reside. Further, 
the wave needs to be associated with a localized 
mechanism that can irreversibly reduce the cloud 
fraction on time scales of tens of minutes. Lastly, 
the cloud deck itself must be susceptible to this 
erosion by the atmospheric gravity wave. 

Why abrupt cloud clearing over the subtrop- 
ical southeast Atlantic is most frequent in May is 
related to some combination of environmental 
conditions favoring the presence of the above- 
mentioned factors in the region and the season 
of interest compared with other seasons and 
regions. MODIS aerosol optical depth measure- 
ments show that abrupt cloudiness transitions 
occur in a variety of aerosol conditions (fig. S1). 
Comparison of large-scale conditions derived 
from the Modern-Era Retrospective Analysis for 
Research and Applications (MERRA) for May, 
the month with the most cloud erosion bounda- 
ries, and for January, the month with the fewest, 
indicates weaker stability in May than in January 
(a median estimated inversion strength of 4.3 K 
versus 5.1 K) but stronger subsidence (700-hPa 
vertical pressure velocity of 0.049 Pa s' versus 
0.032 Pas’). 

Our hypothesis is that westward-moving at- 
mospheric gravity waves cause the abrupt cloud- 
iness transitions in the southeast Atlantic and 
are triggered by the interaction of offshore flow— 
likely combining a nocturnal land breeze and 
downslope winds from the coastal highlands 
(33)—with the stable marine boundary layer in a 
manner similar to processes that generate cloud- 
forming atmospheric gravity waves (26, 28, 30, 31). 
For individual cloud erosion boundaries, we typ- 
ically do not see evidence of slowing with in- 
creasing distance from the coast, as would be 
expected from a thinning, dissipating cold-air 
mass originating over land (Fig. 3). Among the 
three subtropical marine stratocumulus cloud 
decks in the northeast Pacific, southeast Pacific, 
and southeast Atlantic, the deck in the southeast 
Atlantic has the largest diurnal wind variability 
near the adjacent coast (34), which would be 
consistent with regular offshore flow events 
triggering gravity waves. Comparable westward- 
moving abrupt cloud-clearing events are not 
common in either the southeast Pacific or the 
northeast Pacific. 

The removal of cloud over tens of minutes and 
the observation of bisected closed cellular cloud 
structures suggest a fast-acting mechanism. We 
hypothesize that cloud erosion is a consequence 
of rapid entrainment of warm and dry air from 
the free troposphere into the cloud layer by en- 
hanced turbulence associated with a solitary wave 
train excited by offshore flow emanating from 
southwest Africa. Locally enhanced turbulent 
kinetic energy and mixing across the inversion 
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Fig. 4. Examples of multiple small-scale cloud wave trains propagating southeastward, 
southwestward, and eastward over the southeast Atlantic. MODIS corrected reflectance 
images centered off the west coast of Africa on 12 May 2015 during the afternoon Aqua overpass 
are shown. (A) Regional image with 2° grid lines. (B and C) Close-ups of waves overlaid with 

1° grid lines corresponding to the inset boxes in (A). Movie loop animations of this case are shown 


in movies S5 (reflectance) and S6 (IR). 


have been documented within a solitary wave 
packet in the central United States (35). 
Targeted observations in the subtropical south- 
east Atlantic, including the use of dropsondes 
and airborne radar and lidar measurements, will 
be needed to resolve the mechanisms for rapid 
cloud erosion along cloud-eroding boundaries. The 
meteorological conditions associated with the for- 
mation and lack of formation of the cloudiness 
boundaries, as well as features of downslope winds 
and their interactions with the marine stable layer, 
are currently being investigated by combining 
observations, reanalysis, and mesoscale modeling. 
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Persistent, wide-area reductions in the cloud 
fraction over hundreds of kilometers associ- 
ated with these westward-moving cloud erosion 
boundaries contribute to a lower March through 
May average cloud fraction in the subtropical 
southeast Atlantic than in any season in the 
subtropical southeast Pacific or the subtropical 
northeast Pacific (36). Mechanisms that yield 
substantial multiday variability in the marine 
stratocumulus cloud fraction are highly relevant 
to the climate system. Cloud-system resilience 
to external multiday perturbations and the fre- 
quency of these perturbations may be key factors 
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in governing low-cloud variability and potential 
climate sensitivity. 
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RNA METABOLISM 


Mixed tailing by TENT4A 
and TENT4B shields mRNA from 
rapid deadenylation 


Jaechul Lim”?*+, Dongwan Kim"*?*, Young-suk Lee”?*, Minju Ha”, Mihye Lee”, 
Jinah Yeo”, Hyeshik Chang””, Jaewon Song”, Kwangseog Ahn’, V. Narry Kim”?+ 


RNA tails play integral roles in the regulation of messenger RNA (mRNA) translation 

and decay. Guanylation of the poly(A) tail was discovered recently, yet the enzymology and 
function remain obscure. Here we identify TENT4A (PAPD7) and TENT4B (PAPDS) as the 
enzymes responsible for mRNA guanylation. Purified TENT4 proteins generate a mixed poly(A) 
tail with intermittent non-adenosine residues, the most common of which is guanosine. A 
single guanosine residue is sufficient to impede the deadenylase CCR4-NOT complex, which 
trims the tail and exposes guanosine at the 3’ end. Consistently, depletion of TENT4A and 
TENT4B leads to a decrease in MRNA half-life and abundance in cells. Thus, TENT4A and 
TENT4B produce a mixed tail that shields mRNA from rapid deadenylation. Our study unveils 
the role of mixed tailing and expands the complexity of posttranscriptional gene regulation. 


NA tailing (nontemplated nucleotide addi- 

tion to the 3’ end of RNA) is one of the 

most frequent RNA modifications (J, 2). 

We previously developed a method called 

TAIL-seq that reveals the 3’ end sequences 
of the transcriptome (3). TAIL-seq allowed us to 
investigate the mechanism and function of non- 
canonical mRNA tailing, such as U tails. Uridylation, 
particularly oligo-uridylation, occurs selectively 
on the 3’ ends of deadenylated mRNAs and pro- 
motes mRNA decay (4-6). 

Notably, we discovered guanylation at the 3’ 
termini of mRNAs in HeLa cells, expanding the 
complexity of mRNA tails (3). Unlike U tails, gua- 
nylated tails were found mainly among long A 
tails [>~25 nucleotides (nt)] and were detected 
in a form of single-nucleotide addition (Fig. 1A). 
Guanosine residues were located mainly at the 
3' terminal or penultimate positions (Fig. 1B). 
Terminal uridine and cytidine residues were 
found on long A tails, albeit less frequently. Non- 
adenosine residues in other species—such as 
mouse, frog, fish, and fly—suggest a conserved 
modification mechanism (fig. S1A). 

Tailing is catalyzed by a group of template- 
independent nucleotidyl transferases (7). Apart 
from the canonical poly(A) polymerases (PAPs) 
that cotranscriptionally generate mRNA poly(A) 
tails, vertebrates have multiple noncanonical 
PAPs with distinct substrate preferences (fig. 
S1B). Because some noncanonical PAPs catalyze 
uridylation instead of adenylation, noncanoni- 
cal PAPs are also called terminal nucleotidyl- 
transferases (TENTs) (8, 9). To identify the 
enzyme(s) responsible for guanylation, we con- 
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ducted TAIL-seq after combinatorial knock- 
downs of the TENTs. Global guanylation frequency 
was reduced in TENT4A/4B-depleted (i.e., cells 
depleted of TENT4A and TENT48B), as well as in 
TENT1/4A/4B/6-depleted cells (fig. S1, C to E) (5). 
Individual knockdown of TENT4A or TENT4B 
did not affect guanylation (Fig. 2A), suggest- 
ing that these enzymes may act redundantly. 
Transcript-level analysis indicated that TENT4A 
and TENT4B may target a substantial fraction 
of mRNA species (656 out of 901, 72.8%) (Fig. 2B 
and table S1). A comparable result was ob- 
tained with a different set of small interfering 
RNAs (siRNAs) (fig. S1, E and F). Overexpression 
resulted in an increase of guanylation (Fig. 2C 
and fig. S1G), indicating that both TENT4A and 
TENT4B may catalyze guanylation. Of note, the 
previous annotation of TENT4A and TENT4B 
lacked conserved N-terminal extensions that are 
critical for their activity (fig. S2) (10, 11). 
TENT4A (TUT5/PAPD7/hTRF4-1/POLS) and 
TENT4B (TUT3/PAPD5/hTRF4-2/GLD4) are ho- 
mologs of yeast Trf4 (72). Trf4 was previously 
shown to catalyze guanylation, but the report has 


A | B 


been overlooked owing to the predominant ade- 
nylation activity of this protein (73). We found 
in vitro that TENT4B incorporates adenosine 
triphosphate (ATP) efficiently but also other nu- 
cleotides, with a preference for guanosine tri- 
phosphate (GTP) over uridine triphosphate (UTP) 
and cytidine triphosphate (CTP) (Fig. 2D). In vitro 
incorporation assay with TENT4B and nucleoside 
triphosphate (NTP) mixtures revealed that G resi- 
dues were incorporated with a frequency of 15.5%, 
more than that of U (5.7%) and C (5.2%) residues 
(Fig. 2E). Consistently, uridylation and cytidyla- 
tion frequencies decreased in TENT4A/B-depleted 
cells (fig. S1, H and I). Notably, non-adenosines 
were distributed without any positional pref- 
erence at the 3’ terminus (Fig. 2E, bottom right). 
Similar results were obtained with TENT4A 
(Fig. 2E, right), indicating that TENT4A and 
TENT4B generate a mixed tail with a preference 
for guanosine at irregular positions. 

This in vitro result is seemingly inconsistent 
with our observation from TAIL-seq experiments, 
which showed a clear positional preference at 
the 3’ end of poly(A) tails (Fig. 1B). To understand 
this discrepancy, we investigated whether addi- 
tional factors are involved in tail remodeling. 
Deadenylases are 3'-5’ exoribonucleases that 
favor poly(A) sequences (14-18). We hypothe- 
sized that deadenylase(s) may trim down the 
mixed tail and stall at non-A residues, thereby 
exposing non-A at the 3’ terminus. The CCR4- 
NOT (CNOT) complex is the main cytoplasmic 
deadenylase complex and has two catalytic sub- 
units: a Cafl homolog (CAFla/CNOT7 or CAFIb/ 
CNOTS8) and a Ccr4: homolog (CCR4a/CNOT6 or 
CCR4b/CNOT6L) (19, 20). When immunopurified 
CNOT7 proteins were incubated with synthetic 
poly(A) RNAs with intermittent Gs (A50G) (Fig. 3A), 
a ladderlike pattern was detected at the penul- 
timate position (Fig. 3B, orange arrowheads). 
The same assay with catalytic mutants showed 
no ladderlike accumulation (fig. S3A). CNOT6L 
sheared the adenosine sequences but stalled at 
each G residue, yielding G at the 3’ terminus 
(Fig. 3C, red arrowheads). Similarly, C and U 
residues were able to expose the non-adenosine 
residues (fig. S3B). Thus, the CNOT complex trims 
the mixed tail to generate a poly(A) tail with a 
terminal or penultimate non-adenosine residue. 
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These results also indicate that even a single A 


non-adenosine residue can counteract deadenyl- 
ation and stabilize mRNA. Indeed, in vitro de- 
adenylation was substantially delayed in the 
presence of a single terminal G (Fig. 3D, left, 
and fig. $3, C and D). Likewise, 3’ penultimate 
or internal G was sufficient to impede dead- 
enylation (Fig. 3D, right, and fig. S3, D and E). 
Consistently, the poly(A) tail length of mRNA 
was reduced in TENT4A/B-depleted cells (fig. 
S4A). Transcripts with statistically high guanyl- 
ation were affected more strongly than those 
with low guanylation (fig. S4B), supporting our 
hypothesis that guanylation delays deadenylation. 

To further examine whether TENT4A and 
TENT4B have a positive impact on mRNA sta- 
bility, we performed RNA sequencing in TENT4A/B- 
depleted cells. A modest transcriptome-wide 
decrease in mRNA levels was observed (Fig. 4A 
and fig. S4C and table S2). Transcripts with statis- 
tically high guanylation were reduced compared 
with those that have low guanylation (Fig. 4B and 
table S3). Another siRNA set produced similar re- 
sults (fig. S4D). mRNAs with frequent mixed tail- 
ing (including G, U, and C) were also destabilized 
(fig. S4E), consistent with our in vitro dead- 
enylation experiments with non-adenosine res- 
idues (fig. S3B). Highly guanylated genes were 
destabilized in TENT4A/B-depleted cells after 
actinomycin D treatment (Fig. 4C), indicating 
that TENT4A/B-mediated mixed tailing may 
stabilize mRNAs. 

Notably, transcripts encoding secreted proteins 
are frequently guanylated, suggesting an asso- 
ciation between mixed tailing and endoplasmic 
reticulum-mediated translation (table S3). Con- 
versely, many genes involved in protein synthesis 
(such as ribosomal proteins and translation fac- 
tors) are not guanylated frequently, indicating 
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Fig. 2. TENT4A and TENT4B are responsible enzymes for guanylation. (A) Decrease in guanylation 
after knockdown of TENT4A and TENT4B. (B) Transcriptome-wide decrease in guanylation after 


TENT4A and TENT4B knockdown. GPM (guanylated tags per million) was calculated for each transcript. 
Transcript-level GPMs >3 before or after knockdown of TENT4A and TENT4B were illustrated as a 


two-dimensional density plot. Cont, control. (C) Increase in guanylation upon overexpression of TENT4A 
and TENT4B. (D) In vitro incorporation assay with individual NTP and FLAG-TENT4B. (E) In vitro 
incorporation assays with mixed NTPs showed intermittent guanylation. (Left) In vitro assays with 
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S4F), suggesting that the specificity of TENT4A 
and TENT4B may reside in the 3’'UTR. Our motif 
search did not reveal a particular motif enriched 
in the UTR, possibly because multiple elements 
may contribute in combination. TENT4B was 
previously reported to adenylate and destabilize 
miR-21 (27), but neither an increase in miR-21 
level nor a significant change in microRNA 
(miRNA) targets was found in TENT4A/B-depleted 
cells (fig. S5). TENT4A/B homolog Trf4 is a com- 
ponent of the TRAMP (Trf4/Air1/2/Mtr4) com- 
plex (22-26), but guanylation was not reduced in 
MTR4-depleted cells (fig. S6). The results suggest 
that TENT4A and TENT4B regulate mRNA levels 


guanosine at the 3’ end. 
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distances (in nucleotides) 
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Red and orange arrowheads 
denote the 3’ end of G and 
the subsequent A (GA), 
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FLAG-CNOT7. (C) In vitro 
deadenylation assay with 
FLAG-CNOTEL. Asterisks 
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and SEM error bars (n = 3). 
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Fig. 4. Mixed tailing by TENT4A and TENT4B stabilizes mRNA. 

(A) Decrease in steady-state MRNA levels after knockdown of TENT4A 

and TENT4B knockdown (n = 2). The dashed horizontal line indicates the 
median fold change. Labeled mRNAs were validated by quantitative reverse 
transcription polymerase chain reaction (qRT-PCR) (fig. S4C). TPM, 
transcripts per million. (B) Upon knockdown of TENT4A and TENT4B, mRNA 
levels of highly guanylated genes (n = 56; P < 0.01) decreased more than 
those of genes with low guanylation (n = 31; P < 0.01). “Background” refers 


were found with TENT4B-tethered reporters (fig. 
87, C and D). Thus, the non-adenosine incorpo- 
ration of TENT4A and TENT4B may be important 
for mRNA stabilization, although this experiment 
does not rule out the possible contribution of 
adenylation. Of note, the same enzymology and 
function were observed in human foreskin fibro- 
blast (HFF) cells (fig. S8). Growth defects were 
found in TENT4A/B-depleted HeLa and HFF cells 
(fig. S9), hinting at the physiological importance 
of TENT4/B-dependent mRNA tailing (27-29). 
The mRNA tail is thought to be a pure stretch 
of As, with little informational content except for 
its length. In this study, we demonstrate that 
TENT4A and TENT48B generate specialized tails 
that protect mRNAs from active deadenylation 
(Fig. 4E). The protective effect may be attributed 
to both adenylation and mixed tailing. We fo- 
cused on the impact of non-adenosines added by 
TENT4A and TENT4B and discovered the con- 
siderable stalling effect. The wide range of mRNA 
destabilization in TENT4A/B-depleted cells opens 
the possibility of multiple mixed tailing events 
during the mRNA life span. The “not-so-pure” 
poly(A) tail is constantly remodeled by tailing 
enzymes and exoribonucleases and acts as an 
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additional layer for posttranscriptional regula- 
tion, in effect changing the way we think about 
mRNA tails. 
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IMMUNOLOGY 


DNA-induced liquid phase 
condensation of cGAS activates 
innate immune signaling 


Mingjian Du’? and Zhijian J. Chen””’?* 


The binding of DNA to cyclic GUP-AMP synthase (cGAS) leads to the production of the 
secondary messenger cyclic GUP—AMP (cGAMP), which activates innate immune 
responses. We have shown that DNA binding to cGAS robustly induced the formation of 
liquidlike droplets in which cGAS was activated. The disordered and positively charged 
cGAS N terminus enhanced cGAS-DNA phase separation by increasing the valencies 

of DNA binding. Long DNA was more efficient in promoting cGAS liquid phase separation 
and cGAS enzyme activity than short DNA. Moreover, free zinc ions enhanced cGAS 
enzyme activity both in vitro and in cells by promoting cGAS-DNA phase separation. These 
results demonstrated that the DNA-induced phase transition of cGAS promotes cGAMP 


production and innate immune signaling. 


yclic GMP-AMP synthase (cGAS) is a DNA- 

sensing enzyme that catalyzes the conver- 

sion of GTP and ATP to cyclic GMP-AMP 

(cGAMP), which activates the adaptor 

protein STING (J, 2). This, in turn, induces 
type I interferons and other cytokines (3-5). DNA 
arising in the cytoplasm activates cGAS and drives 
the formation of cytoplasmic foci containing cGAS 
and DNA (J). However, the molecular mechanism 
and functional effect of such cGAS foci are poorly 
understood. cGAS contains a disordered and 
positively charged N terminus and a structured 
C terminus harboring a nucleotidyltransferase 
domain (core cGAS). Both the N and C termini 
of cGAS bind to DNA irrespective of DNA se- 
quence (6-11). We hypothesized that such multi- 
valent interactions could lead to the formation 
of large membraneless protein foci through liquid 
phase separation (Fig. 1A). This physicochemical 
process has emerged as a key mechanism under- 
lying the formation of cellular bodies such as P 
granules and nucleoli (12-20). 

To test whether DNA binding induces phase 
separation of cGAS in vitro, we incubated fluo- 
rescently labeled cGAS protein (fig. SIA) with 
double-stranded DNA (dsDNA) oligonucleotides 
[100 base pairs (bp)] (table S1 and methods). Upon 
mixing, cGAS and DNA formed micrometer-sized 
liquid droplets within 2 min (Fig. 1B and movie 
$1). Small liquid droplets fused into larger ones 
(Fig. 1C), accompanied by increased fluorescence 
intensity and larger equivalent diameter (Fig. 1, 
D and E, and movie S1). Fluorescence recovery 
after photobleaching (FRAP) experiments showed 
that when bleaching was performed 30 min after 
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the initiation of phase separation, the fluores- 
cence of cGAS or DNA was efficiently recovered 
in a temperature-dependent manner (Fig. 1F and 
fig. S1, B to D). In contrast, when bleaching oc- 
curred 2 hours after mixing of cGAS and DNA, 
the fluorescence recovery was much slower (Fig. 
1F and fig. S1D). Thus, cGAS and DNA molecules 
within the liquid droplets are mobile and exhibit 
dynamic internal rearrangement in the early phase 
but gradually undergo a liquid-to-solid transition 
and mature into a gel-like state (Fig. IF) (19, 21). 

cGAS and DNA formed liquid droplets when 
the concentration of each exceeded 30 nM (Fig. 
1G) in a buffer mimicking physiological ion con- 
centrations and the composition of the cytoplasm 
(physiological buffer) (methods). Measurements 
of cGAMP production by cGAS revealed that 
the specific activity of cGAS substantially in- 
creased at concentrations above 30 nM (Fig. 1H 
and methods). The in vitro cGAS-DNA phase 
transition was weakened by increasing salt (NaCl) 
concentrations (fig. S2A), suggesting that ionic 
interactions between cGAS and DNA were im- 
portant for the phase transition. A 45-bp dsDNA 
commonly used for the stimulation of the cGAS 
pathway [immune stimulatory DNA (ISD)] also 
robustly induced cGAS liquid droplet formation 
(fig. S2, B to D). This effect was abolished by 
treatment with Benzonase, which degrades DNA 
(fig. S2E). cGAS also formed liquid droplets with 
45-bp double-stranded RNA (fig. S2F), but RNA 
did not activate cGAS to produce cGAMP (fig. 
S2G). These results are consistent with previous 
structural studies indicating that DNA but not 
RNA binding induces a conformational change 
that activates cGAS (7). Thus, liquid phase separa- 
tion is insufficient for cGAS activation in the 
absence of the correct conformational change 
induced by DNA. 

The cGAS-DNA phase separation was unaf- 
fected by the addition of ATP, GTP, or a com- 
bination thereof (fig. S3A). Moreover, ATP or 
GTP could be partitioned into and enriched in 


the cGAS-DNA liquid droplets (fig. S3, B and C). 
FRAP experiments showed that ATP was rapidly 
exchanged into and out of the cGAS-DNA droplets 
(fig. S3D). 

We next examined the formation of cGAS- 
DNA foci within cells. In the human fibroblast 
cell line BJ-5ta stably expressing a Halo-tagged 
cGAS, cGAS formed puncta with fluorescein- 
labeled ISD in the cytoplasm (Fig. 2A and fig. 
S4A). To confirm that cGAS formed large granules 
with DNA in the cytoplasm, we used cGAS- 
deficient mouse embryonic fibroblast (MEF) 
cells reconstituted with green fluorescent pro- 
tein (GFP)-cGAS, transfected the cells with Cy5- 
ISD, and then permeabilized the cells with 
saponin (22). cGAS formed puncta with ISD and 
remained in the cytoplasm after saponin treat- 
ment (Fig. 2B). The cGAS-DNA foci exhibited 
liquidlike properties, as demonstrated by the 
ability of two foci to fuse with each other (Fig. 
2C). Furthermore, upon photobleaching, cGAS 
in the foci displayed near-complete fluorescence 
recovery within 120 s (Fig. 2, D and E), indicating 
that cGAS exhibits dynamic liquidlike behavior 
within cellular granules. 

To determine the functional consequence of 
cGAS liquid droplet formation, we transfected 
HeLa cells with herring testis DNA (HT-DNA) 
and measured cGAS activity in subcellular frac- 
tions (fig. S4B). Most cGAS activity was present 
in the pellet obtained after centrifugation at 
2,000 x g (designated P2), which contains mainly 
the nuclei and heavy particles (Fig. 2F). Further 
separation of the P2 fraction by iodixanol (OptiPrep) 
density gradient ultracentrifugation revealed two 
distinct pools of cGAS activity. The first pool was 
in very heavy fractions (20 to 25% iodixanol) 
(Fig. 2G), which were separated from the nuclei 
(27.5 to 30% iodixanol). Thus, cGAS appeared 
to form heavy particles with transfected DNA 
that were distinct from cellular organelles and 
vesicles and that contained active cGAS. The 
second pool of cGAS activity was in fractions con- 
taining the nuclei. However, it remains to be 
determined whether this activity came from 
cGAS within the nuclei or some cGAS particles 
that cosedimented with the nuclei. Similar re- 
sults were also obtained with human monocytic 
THP-1 cells (fig. S4, C and D). 

Multivalent interactions drive liquid phase se- 
paration (16). Long DNA has more binding sites 
(higher valency) for cGAS than short DNA, and 
full-length cGAS (FL-cGAS) has higher valency 
for DNA than core cGAS (Fig. 3A) (6, 7, 11). To 
test whether cGAS-DNA liquid phase separation 
is driven by the valency of cGAS and DNA inter- 
actions, we incubated FL-cGAS and N-terminally 
truncated cGAS (AN146-cGAS, where the numeral 
indicates the number of amino acids deleted) 
with DNA of different lengths in the physiological 
buffer (5 mM NaCl and 135 mM KCl) or a buffer 
containing 300 mM NaCl. Human FL-cGAS formed 
more numerous and larger liquid droplets with 
longer DNA (Fig. 3B). Both human and mouse 
FL-cGASs exhibited stronger phase separation 
than N-terminally truncated cGAS with DNA 
of the same length either in the physiological 
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Fig. 1. DNA binding to cGAS induces the formation of liquidlike droplets 
in which cGAS is activated. (A) Schematic of hypothetical CGAS-DNA 
interactions that drive liquid phase condensation. (B) Time-lapse imaging of 
cGAS-DNA phase separation. Liquid droplets formed after mixing of 10 uM 
human FL-cGAS (3% was labeled with Alexa 488) with 10 uM 100-bp DNA 
(2% was labeled with Cy3) and matured over 60 min. The images shown are 
representative of all fields in the well. (C) Time-lapse micrographs of merging 
droplets that formed as described in the legend to (B). Data are representative of 
at least 10 merging droplets. (D) Fluorescence intensities of CGAS-DNA liquid 
droplets forming over the time course of 120 min. Data were normalized to 
100% by maximal fluorescence intensity. Values shown are means + SD. n = 4 
images. AF488, Alexa Fluor 488. (E) EqDiameter (the diameter of a circle with the 
same area as the measured object) frequency distribution of CGAS-DNA liquid 
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droplets formed at the indicated time points. cCGAS, 5 uM; DNA, 5 uM. (F) FRAP of 
cGAS-DNA liquid droplets. Bleaching was performed at the indicated time 
points after cCGAS (10 uM) and DNA (10 uM) were mixed, and the recovery 

was allowed to occur at 25° or 37°C. Time O indicates the start of recovery after 
photobleaching. Shown are the means + SD. n = 3 liquid droplets. n.s., not 
significant (P > 0.0332); ***P < 0.0002; ****P < 0.0001 [one-way analysis of 
variance (ANOVA)]. (G) Phase separation diagram of human FL-cGAS and 
100-bp DNA at the indicated concentrations. (H) (Left) cGAMP production by 
the indicated concentrations of cGAS in the presence of ATP, GTP, and 

HT-DNA. cGAMP production at low cGAS concentrations is shown in the inset. 
(Right) Normalized cGAMP production divided by cGAS concentrations. Shown 
are the means + SD. n = 3 assays. Data are representative of at least three 
independent experiments unless indicated to be otherwise. 
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Fig. 2. DNA-induced liquid phase separation of cGAS in cells. 
A) Representative live-cell images of cGAS-DNA puncta formation 
after BJ cells stably expressing Halo-cGAS, which was covalently 
abeled with tetramethyl rhodamine, were transfected with 
fluorescein-ISD. Insets are zoomed images showing cGAS-DNA 
puncta. Scale bars, 10 um. These images are representative 

of at least 10 cells. (B) MEF cells stably expressing GFP-cGAS were 
transfected with Cy5-ISD for 4 hours, after which they were 
permeabilized with saponin and analyzed by fluorescence 
microscopy. Shown in blue is the plasma membrane marker 
wheat germ agglutinin. Arrows indicate puncta. Scale bars, 50 um. 
These images are representative of at least five fields examined. 
(C) Time-lapse micrographs of cGAS (green) and DNA (magenta) 
puncta formation and fusion (time O represents 30 min 

after transfection with Cy5-45-bp ISD). Arrows indicate puncta. 
Scale bar, 15 um. The fusion events existed in all eight fields 
examined. (D) Representative micrographs of cGAS-DNA 

puncta before and after photobleaching (arrow, bleach site). 

Scale bar, 15 um. These images are representative of at least 
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three cells in which the cGAS-DNA puncta were photobleached. 

(E) Quantification of FRAP of cGAS-DNA puncta over a 120-s time 
course. K, exponential constant; R, normalized plateau after 
fluorescence recovery. Shown are means + SD. n = 3 cGAS-DNA puncta. 
(F) Subcellular fractionation of cGAS activity in cells transfected 
with DNA. HeLa cells transfected with HT-DNA or untransfected 
HeLa cells were fractionated by differential centrifugation as 
depicted in fig. S4B. Fractions were incubated with ATP and GTP, 
after which cGAMP was measured. Fractions were also analyzed 

by immunoblotting (IB) with antibodies against histone H2A 
(nuclear marker), glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) (cytoplasmic marker), or human cGAS (hcGAS). 

(G) The P2 fractions from (F) were further separated by OptiPrep 
gradient ultracentrifugation, and cGAS activity in different 
fractions was measured as for (F). Fractions from cells not 
transfected with DNA had no cGAS activity (upper panel). 

Error bars in (F) and (G) represent the variation range of duplicate 
assays. Data are representative of at least three independent 
experiments. 
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buffer (Fig. 3C) or at 300 mM NaCl (fig. S5C). The 
enzymatic activity of FL-cGAS in the presence of 
HT-DNA was stronger than that of AN146-cGAS 
in both low-salt buffer (Fig. 3D) and physiolog- 
ical buffer (Fig. 3E). 

To investigate the role of the cGAS N terminus 
in cells, we reconstituted cGAS-deficient MEF 
cells with human FL-cGAS or AN160-cGAS (fig. 
S5F). After transfection with Cy5-ISD, FL-cGAS 
formed puncta with Cy5-ISD in the MEF cells, 
whereas AN160-cGAS formed fewer puncta (Fig. 
3F and fig. $6). cGAMP production was also 
higher in cells stably expressing FL-cGAS than in 
those expressing AN160-cGAS upon transfection 
with 45-bp ISD or HT-DNA (Fig. 3G; cGAS ex- 
pression levels are shown in fig. S5F). 

cGAS enzyme activity was much weaker in the 
assay with physiological buffer than in the assay 
with low-salt buffer (fig. S5, D and E). This raises 
the question of how cGAS is activated in cells. We 
found that zinc ions substantially promoted the 
activity of recombinant cGAS in the physiological 
buffer and that this enhancement could be par- 
tially replaced by other ions, such as Mn** or Co** 
(Fig. 4A and fig. S7A), which is a general char- 
acteristic of enzymes that require zinc (23). Sim- 
ilarly, Zn?* was the most efficient in activating 


(Increasing Valency) 


OQOOOOO®D, 45 bp 


mouse cGAS (fig. S7, B and C). The optimal con- 
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centrations of Zn?* were 160 to 625 uM (Fig. 4A 
and fig. S7C), which is within the physiological 
concentration range of zinc ions in cells (24, 25). 
Zn?* at ~200 uM markedly facilitated DNA- 
induced cGAS phase separation at low concen- 
trations of cGAS and DNA (Fig. 4B and fig. S7D). 
Moreover, cGAS activity at low concentrations 
was markedly enhanced in the presence of Zn* 
(Fig. 4C). At a concentration as low as 2.5 nM, 
which is below the concentration of cGAS in the 
cytoplasm of HeLa cells (approximately 8 to 
12 nM) (methods), cGAS underwent phase transi- 
tion and catalyzed cGAMP synthesis (Fig. 4, B 
and C). At 10 nM cGAS, 100-bp DNA activated 
cGAS with a median effective concentration (ECs0) 
of ~1.4. nM, whereas 45-bp DNA had much weaker 
activity. DNA at 25 bp or shorter had no detect- 
able activity in the in vitro assay (Fig. 4D). This 
DNA length-dependent activation of cGAS was 
largely mirrored in the cellular assay in which 
a THP-1 reporter cell line was transfected with 
DNA (Fig. 4E). 

Using a thermal shift assay, we found that 
DNA binding destabilized cGAS but that Zn?* 
stabilized the cGAS-DNA complex (Fig. 4F and 
fig. S8A). Measurements of free Zn?* concentra- 
tions revealed that Zn?* bound to cGAS but not 
to DNA (Fig. 4G). To determine whether zinc 


Fig. 3. Multivalent interactions drive 
cGAS-DNA condensation and promote 
cGAS activation. (A) Schematic of 
hypothetical cGAS and DNA valencies. 

(B) Representative images of phase 
separation by mixing of cGAS (10 uM) with 
dsDNA of different lengths (10 uM) in 
physiological buffer. Scale bar, 10 um. 

(C) Bright-field images of phase separation 
by mixing of DNA of different lengths 

with full-length or N-terminally truncated 
human or mouse cGAS as indicated. Scale 
bar, 20 um. The images shown in (B) and 
(C) are representative of all fields in the 
wells. (D and E) cGAMP production by 
different concentrations of recombinant 
human FL-cGAS or N-terminally truncated 
cGAS in low-salt buffer (D) or physiological 
buffer (E). Shown are the means + SD. 

n = 3 assays. (F) Quantification of 
cGAS-DNA puncta by imaging of MEF 
cells expressing GFP-tagged human 
FL-cGAS or AN160-cGAS after transfection 
with Cy5-ISD. Representative images 

are shown in fig. S6. Values shown are 
means + SD. n = 5 images. (G) cGAMP 
production in the MEF cells expressing 
human FL-cGAS or AN160-cGAS after 
transfection with ISD or HT-DNA. Values are 
means + SD. n = 3. (F) and (G): ****P < 
0.0001 (multiple t tests). cGAS expression 
levels are shown in fig. SSF. Data are 
representative of at least three 
independent experiments. 


plays a role in cGAS activation within cells, we 
depleted L929 cells of zinc with the zinc-specific 
chelator N,N,N',N’-tetrakis(2-pyridinylmethyl)-1,2- 
ethanediamine (TPEN). Cellular cGAMP production 
upon transfection with HT-DNA was decreased 
in a TPEN concentration-dependent manner (Fig. 
4H and fig. S8B). Under these conditions, TPEN 
did not affect the viability of L929 cells (fig. S8C). 
Live-cell imaging with a zinc-specific fluorescent 
probe revealed that cGAS-DNA puncta contained 
zinc (fig. S8, D and E). These results showed that 
zinc facilitated cGAS activation in cells by promot- 
ing cGAS phase transition in the presence of 
cytosolic DNA. 

Thus, DNA binding to cGAS induces a robust 
phase transition to liquidlike droplets, which func- 
tion as microreactors in which the enzyme and 
reactants are concentrated to greatly enhance 
the production of cCGAMP. This mechanism allows 
cGAS to detect the presence of DNA in the cyto- 
plasm above a certain threshold to trigger a 
switchlike response. Such a switchlike response 
is made possible by the multivalent interactions 
between the DNA binding domains of cGAS 
and DNA in a manner that depends on the DNA 
length. This also provides an explanation for 
why long DNA activates cGAS more efficiently. 
The binding between cGAS and DNA involves 
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Fig. 4. Zinc ions promote DNA-induced phase 
separation and activation of cGAS. (A) Zn** 
enhances cGAS activation in vitro. Recombinant 
human FL-cGAS (15 nM) was incubated with 
ATP, GTP, and DNA in a physiological buffer 
containing the indicated concentrations of Zn", 
and cGAMP production was measured. 

(B) Quantification of cGAS-DNA condensates 

in the presence or absence of zinc. Liquid-phase 
condensates formed after mixing of Alexa 

Fluor 488-labeled human FL-cGAS with 45-bp 
Cy3-labeled ISD at the indicated concentrations 
of each in physiological buffer with or without 
Zn** (200 uM). Images were then captured by 
confocal microscopy, and representative images 
are shown in fig. S7D. Values are means + SD. 
n= 5 images. P values are from multiple t tests. 
(C) cGAMP production in physiological buffer 
containing HT-DNA and different concentrations 
of cGAS in the presence or absence of Zn** 
(200 uM). The activity of cGAS at low 
concentrations is shown in the inset. P values 
are from multiple t tests. (D) cGAMP 
production by 10 nM cGAS in physiological 
buffer containing 200 uM Zn** and different 
concentrations of DNA of the indicated lengths. 
(E) THP-1-Lucia ISG cells, which harbor a 
luciferase gene under the ISG54 promoter, were 
transfected with the indicated DNA for 24 hours, 
after which the secreted luciferase activity 

was measured. RLU, relative luciferase units. 
(F) Thermal shift assay to measure the stability 
of cGAS or the cGAS-DNA complex in the 
presence or absence of Zn** (200 uM). Tm. 
protein melting temperature. Values are 

means + SD. n = 3. P values are from an 
unpaired t test. (G) Measurement of cGAS 


binding to zinc. Zinc ions (10 uM) were incubated with various concen- 
trations of DNA, cGAS, or both, and the solution was passed through a 
centrifugal filter, after which the zinc ion concentration in the filtrate was 
measured. The dissociation constant (Kg) values for zinc binding to 
cGAS and the cGAS-DNA complex were 3.9 + 1.3 uM and 3.0 + 0.4 uM, 
respectively. (H) Depletion of intracellular zinc inhibits cGAS activation by 
DNA. L929 cells were incubated with the indicated concentrations 


extensive ionic interactions between the positively 
charged surfaces of cGAS and negatively charged 
DNA. Such interactions are vulnerable to cyto- 
plasmic salt concentrations, which may be a 
mechanism to prevent spurious activation of 
cGAS by self-DNA below a certain threshold. 
However, we found that zinc ions could substan- 
tially enhance cGAS phase separation and its 
enzymatic activation at physiological salt con- 
centrations. Free zinc ions are stored mainly in 
organelles such as the mitochondria and the 
endoplasmic reticulum (26), and their delivery 
to the cytosol may be another avenue by which 
cGAS activity is regulated in cells. DNA binding 
to cGAS induced formation of cGAS-DNA con- 
densates, which were observed as cytoplasmic 
foci within cells. Further characterization of the 
dynamics and composition of the cGAS con- 
densates should provide deeper insights into the 
mechanism by which cGAS activity is tightly reg- 
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ulated to trigger an appropriate immune response 
to pathogens while simultaneously avoiding auto- 
immune reactions to self-tissues. 
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Fragile X mental retardation 1 gene 
enhances the translation of large 
autism-related proteins 


Ethan J. Greenblatt and Allan C. Spradling* 


Mutations in the fragile X mental retardation 1 gene (FMR1) cause the most common 
inherited human autism spectrum disorder. FMR1 influences messenger RNA (mRNA) 
translation, but identifying functional targets has been difficult. We analyzed quiescent 
Drosophila oocytes, which, like neural synapses, depend heavily on translating 

stored mRNA. Ribosome profiling revealed that FMR1 enhances rather than represses 
the translation of mRNAs that overlap previously identified FMR1 targets, and acts 
preferentially on large proteins. Human homologs of at least 20 targets are associated 
with dominant intellectual disability, and 30 others with recessive neurodevelopmental 
dysfunction. Stored oocytes lacking FMR1 usually generate embryos with severe 

neural defects, unlike stored wild-type oocytes, which suggests that translation of multiple 
large proteins by stored mRNAs is defective in fragile X syndrome and possibly other 


autism spectrum disorders. 


MR1is a polysome-associated RNA binding 

protein required for the nervous system 

and ovary to develop and function normally 

in humans, mice, and Drosophila (J, 2). Both 

tissues translationally control stored mRNAs 
associated with FMR1-containing ribonucleo- 
protein particles (RNPs) (3-6), which suggests 
that FMRI has a specific function in using stored 
mRNAs. However, the challenge of obtaining 
highly enriched FMR1-containing RNPs from 
neural tissue may have contributed to difficulties 
in defining FMR1 target genes (7-9). We reasoned 
that FMR1 function could be studied in a physio- 
logically relevant context using mature Drosophila 
oocytes, which lack transcription and depend 
entirely on ongoing translation. 

Drosophila oogenesis is highly amenable to 
such studies because each female can hold up 
to 30 mature oocytes per ovary for several weeks 
(Fig. 1A). In the ovary, each oocyte is surrounded 
by 800 somatic cells constituting a follicle. We 
found a reliable method (see supplementary mate- 
rials) to maintain completed follicles in the ovary 
for a known period of time in the absence of new 
follicle maturation (Fig. 1B). Ovulation and fer- 
tilization could then be induced by adding males 
to test the stored oocyte’s ability to support 
embryogenesis. 

We tested the function of specific genes during 
oocyte storage by depleting their transcripts with 
germline-specific GAL4-driven RNA interference 
(RNAi), which is produced throughout oogenesis 
starting in the germline stem cell. Disrupting a 
gene required during oocyte storage would cause 
oocytes to at first develop normally, and then to 
lose developmental capacity more rapidly than 
the wild type during further storage. To screen 


Howard Hughes Medical Institute Research Laboratories, 
Department of Embryology, Carnegie Institution for Science, 
Baltimore, MD 21218, USA. 

*Corresponding author. Email: spradling@ciwemb.edu 


Greenblatt et al., Science 361, 709-712 (2018) 


for such genes, we depleted Fmrl mRNA or sev- 
eral other candidate gene transcripts and analyzed 
oocytes after a storage period of 1 day or 10 days 
(Fig. 1C). In most gene knockdown lines and in 
wild-type controls, the hatch rate was nearly 
100% after 1 day of storage, and after 10 days 
only dropped to 80%. In contrast, Fmri-depleted 
oocytes hatched normally at first, but after 10 days 
of storage, only 20 to 30% of them completed 
development (Fig. 1C). Germline Fmr RNAi dras- 
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tically reduced Fmr] mRNA levels, and antibody 
staining confirmed that FMRI protein was effec- 
tively depleted throughout oogenesis specifically in 
germ cells but not in somatic cells (fig. S1). 

We validated the Fmri requirement in stored 
oocytes by showing that oocyte viability drops 
continuously over time without Fmri (Fig. 1D). 
The defects were specific to the old oocytes them- 
selves. Refeeding the mother caused the remain- 
ing stored oocytes to be laid and new oocytes to 
mature, which were fully functional (Fig. 1D). 
Thus, germline Fmr] RNAi does not observably 
impair stem cells, follicle development, or any 
nonautonomous aspects of female germline func- 
tion. This differs from FmrI mutants, which lose 
stem cells and produce fewer follicles because 
mutants affect niche cells and other somatic cells 
in addition to germ cells (JO). 

We analyzed embryos derived from control 
and FMRI-deficient oocytes to investigate what 
processes are affected by FMRI depletion. Em- 
bryos from control oocytes developed a normal 
nervous system regardless of prior storage, as 
shown by staining with broadly expressed neural 
markers (Fig. 2, A and B). The same was true of 
embryos derived from Fmr1 RNAi oocytes after 
1 day of storage (Fig. 2A). However, in embryos 
derived from follicles lacking FMR1 during 10 days 
of storage, ventral nerve cord-specific labeling 
showed missing commissures and breaks in the 
longitudinal connectives (Fig. 2C), in contrast to 
the wild type. These neuronal defects were not 
due to a generalized deterioration of the FMR1- 
depleted oocyte’s ability to support embryonic 
development (fig. S2). We never observed com- 
parable neural defects in embryos derived from 
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Fig. 1. Fmr1 is specifically required during the storage of mature, quiescent stage 14 oocytes 
in the ovary. (A) Schematic of a Drosophila ovariole with immature precheckpoint follicles and two 
stored mature stage 14 follicles. (B) Arrested mature follicle stability (red) following feeding protocol 
as described (see supplementary materials). (©) Fmrl knockdown (lines 1 and 2), but not 
knockdowns of controls or other indicated genes, specifically reduces 10-day stored (red) but not 
1-day stored (black) oocytes from developing into hatching larvae. (D) Fmr1 germline RNAi 

during storage progressively reduces the fraction of mature oocytes competent after 1, 4, 7, or 

10 days of storage to support development. Refeeding females to promote maturation of fresh stage 
8 follicles restores full developmental potential. Error bars in (B) and (D) denote SD; n.s., not 


significant. GFP, green fluorescent protein. 
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wild-type oocytes, even when they were stored 
for 14 days when fewer than 20% of their em- 
bryos developed to hatching (Fig. 2D). In contrast, 
more than 50% of embryos derived from FMRI1- 
deficient oocytes stored in the ovary for 10 days 
developed a severely abnormal nervous system 
(Fig. 2E). Thus, disrupting FMR1 function while 
oocytes are fully dependent on translational regu- 
lation specifically compromised their ability to 
support neural development relative to controls. 

To determine how FMRI disruption affects 
oocyte translation and to identify FMRI trans- 
lational target genes that may be important for 
sustaining nervous system development, we de- 
veloped a ribosome-profiling protocol based on 
(1D) (see supplementary materials) to quantify 
oocyte translation in an unbiased, genome-wide 
manner. Flies induced for oocyte storage were 
analyzed after only 1 or 2 days to identify initial 
translational changes in healthy Fmri knockdown 
oocytes prior to viability reduction. Although 
whole ovaries were needed to obtain enough 
material for profiling, most of the analyzed 
ribosomes should still derive from stored stage 
14 follicles, because they are larger than the total 
size of earlier follicles. Interestingly, the ribosome 
footprints and transcript levels of most mRNAs 
were unaffected by germline Fmr] knockdown 
(Fig. 3A); this result indicates that FMR1 does not 
generally control translation or mRNA stability. 

From 11 independent, highly reproducible ribo- 
some profiling experiments (fig. S3), we identified 
421 genes whose germline expression signifi- 
cantly declined and 14 genes that significantly 
increased expression in FmrI RNAi (Fig. 3, B 
and C, and tables S1 and S82). Except for Fmri 
itself, translation of the significantly altered tar- 
gets generally decreased by a factor of 1.3 to 2.5, 
which we verified by Western blotting (fig. S4), 
whereas their RNA levels were unchanged (Fig. 3B). 
Many down-regulated genes, at least 56 of 421, 
are orthologs of human genes that have been im- 
plicated in human neurodevelopmental syn- 
dromes (Fig. 3C and table S3)—a proportion 
greater than expected by chance (P = 1.1 x 10°; 
fig. S5A). For example, the neurofibromatosis 
gene Nf1, which is associated with cognitive and 
behavioral disorders and neural tumors (12), was 
reduced by a factor of 2.5. Several E3 ubiquitin 
ligases, including CTRIP/TRIP12, POE/UBR4, and 
HUWE1, whose human homologs are associated 
with intellectual disability, autism, early-onset de- 
mentia, and schizophrenia (13-15), were reduced 
by about a factor of 2. In total, homologs of at 
least 20 dominant autism/intellectual disability 
genes were significantly reduced (table S3). Be- 
cause mutations in these genes are dominant 
(16), a factor of 2 reduction in expression has po- 
tential consequences, even for a single target. 

To determine whether FMRI acted on target 
transcripts through direct binding, we compared 
our candidate FMR! targets with a previous report, 
which used proximity-based strategies to cross- 
link mRNAs in brain tissue before immuno- 
precipitation of FMR1 (7). We found significant 
overlap between both datasets (P = 1.2 x 10°; 
fig. S5B), which suggests that FMR1 directly 
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Fig. 2. Stored FMR1-depleted 
oocytes frequently generate 
embryos with neural defects. 

(A) Control oocytes stored in vivo 
for 1 day or 10 days support normal 
embryonic nervous system devel- 
opment (22C10 antibody). Fmr1 
RNAi oocytes stored in vivo for 

1 day support normal development 
but after storage for 10 days pro- 
duce highly abnormal nervous 
system development. (B) Normal 
ventral nerve cord (BP102 anti- 
body) from embryo developed from 
10-day control oocyte. (C) Abnor- 
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binds many affected transcripts. However, our 
results differed from prior studies in two impor- 
tant ways. First, the great majority of targets 
decreased in expression, indicating that FMR1 
usually enhances rather than represses transla- 
tion, in contrast to most (17, 18) but not all (9, 19) 
previous reports. This difference might arise be- 
cause multiple FMR1 targets act negatively on 
protein stability, translation, or cell growth, in- 
cluding nine ubiquitin ligases, Nfl, and Not1 
(table S1). Down-regulation of these negative 
regulators after Fmr1 loss might substantially 
increase protein levels, simulating the direct ef- 
fects of a translational repressor. 

Second, almost all of the affected proteins are 
much larger in size than the average Drosophila 
protein. Dividing mRNAs undergoing transla- 
tion into size classes showed that FMR1 strik- 
ingly affects translation in proportion to protein 
size (Fig. 3, C and D) and to some extent UTR 
length (fig. S6A), but not by transcript level 
(fig. S6D). The translation of nearly half (46%) 
of expressed proteins longer than 2000 amino 
acids, 13% of proteins 750 to 1000 amino acids 
in length, but only 1% of proteins shorter than 
250 amino acids was significantly reduced with 
Fmr!i knockdown (Fig. 3C). Fmri1 RNAi did not 
always impair the translation of large proteins, 
as there was a broad response to Fmri knockdown 
(Fig. 3C) and the translation of many large pro- 
teins was only minimally affected. FMR1 targets 
had low translation efficiencies (TEs) in oocytes 
(Fig. 3D), similar to long transcripts generally 
(fig. S6B). However, FMR] boosted the translation 
of affected long transcripts regardless of TE 
(Fig. 3E), implicating size and not low TE as the 
predominant factor. The size effect was not due 
to reduced processivity, because we observed a 
uniform reduction in footprints across the entire 
coding sequence of target mRNAs (Fig. 3F). The 
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mal ventral nerve cords including 
broken or fused connectives 
(arrows) from three embryos 
developed from 10-day Fmr1 RNAi 
oocytes. (D) Normal nerve cord 
from a control oocyte stored for 
14 days. (E) Summary of nervous 
system development in embryos 
from control (GFP) and Fmrl RNAi 
oocytes (N = 50). All scale bars, 20 um. 


preferential effect on large mRNAs is likely me- 
diated by direct FMR1 binding, because the 
average size of target proteins common to both 
this study and (7) was 1841 amino acids. Further- 
more, genes linked to autism as a group are sig- 
nificantly longer than average (16, 20). 

The Poe/Ubr4 gene, encoding one of the lon- 
gest Drosophila proteins (5322 amino acids), was 
investigated as an FMR1 target with potentially 
large effects, something not previously identified. 
Both Poe and FmrI mutant Drosophila are male 
sterile, cannot fly, and show increased neuro- 
muscular junction synaptic excitability (27-23). 
Stored Poe mutant oocytes lost developmental 
competence at the same rate as Fmri germline 
RNAi oocytes (Fig. 4A), and these embryos also 
developed a high frequency of neural defects 
(Fig. 4B and fig. S7A). Beginning in nearly ma- 
ture follicles, POE protein formed 0.5- to 2-um 
spherical particles in germ cells (Fig. 4C), which 
were distinct from RNP granules containing the 
P body marker TRAL (fig. S8). POE antibody 
staining was lost in Poe germline RNAi and Poe 
mutants (Fig. 4D). As predicted, POE protein 
levels were reduced and particles were reduced or 
eliminated in Fmri1 germline RNAi and in Fmri- 
null egg chambers (Fig. 4D). These observations 
demonstrate that Poe is a major functional 
target of FMRI, and that POE is itself essential 
to maintain the oocyte’s ability to support neural 
development. However, overexpression of POE 
using a duplication in an FmrI RNAi background 
restored POE granule expression (Fig. 4D) but 
did not rescue Fmr] RNAi lethality upon storage 
(fig. S7F). 

FMRY1’s function in maintaining an oocyte’s 
ability to support neural development may only 
be revealed during oocyte storage because some 
FMRI targets act catalytically. For example, E3 
ligases whose levels are reduced in FMR1-deficient 
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oocytes might no longer be able to prevent the 
overaccumulation of some of their target proteins, 
which might eventually reach levels in the stored 
oocytes that interfere with neural development. 
FMRI regulates translation both in the ovary 
and during neural development as part of RNP 
granules that may either stimulate or repress 
translation. Studies of these granules suggest a 


A 


potential explanation for the size effect we ob- 
served on target proteins. Mammalian and yeast 
mRNAs that are large and/or inefficiently trans- 
lated preferentially associate with stress granules 
where their translation is repressed (24). FMR1 
may function on specific targets to counteract the 
inherent general tendency of large mRNAs to be 
segregated into inactive RNP particles. Alterna- 


tively, FMR1 might directly or indirectly promote 
translation initiation in association with RNPs, 
or it might affect mRNA transport along micro- 
tubules to sites of active initiation. 

We propose that maintaining the translation 
of large mRNAs in the translational environments 
experienced far from the nucleus at synapses in 
many neurons represents a general challenge that 


Ribosome 
Footprinting 
=0.95 | 


mRNA-seq 


mino Acids 


Fmr1 RNAi 
log,, TPM 
Gene 


Fmri 682 
Nf1 2765 
Ana3 1978 
Huwet1 5147 
Fry 3545 

=e Poe 5323 
Ctrip 3141 
05=5 IntS1 2054 
Bet 25 CG15099 2600 
CG14967 2301 
Nup188 1867 
Rme-8 2408 
CG16974 1258 
Shtd 2031 
ProSAP 1872 
Faf 2762 

Epg5 2456 
Ulp1 1514 
GramD1B 1207 
Not1 2504 


Fmr1 RNAi 
log,, TPM 
o-NWA YW 


345 012345 
Control RNAi 
log,, TPM 


01 2 
Control RNAi 
log,, TPM 


# Amino Acids 


Cumulative 9 
Frequency 


Fold-change in Translation 
log, Fmr1 RNAi / Control 


# Amino Acids: All Transcripts 


> 2000 1000-2000 
Fig. 3. FMR1 stimulates translation during storage of transcripts from 


32101 3-2-1017 327071 

a TE 
multiple intellectual disability and autism genes. (A) Translational 
profile [logio(TPM), transcripts per million] and mRNA abundance profile 
[mMRNA-seq, logio(TPM)] are highly similar between control and Fmr1 RNAi 
oocytes (stored 1 to 2 days). (B) Top genes translationally reduced by 
Fmr1 RNAi from 11 ribosome-footprinting experiments do not show 
significant changes in MRNA levels. (C) Significance versus relative (fold) 
change plot reveals 421 candidate targets translationally stimulated by 


Cumulative ™ 
Frequency 


A 


OreR / yw 
mGherry RNAi 
Poe™®9 / + 


Fmr1 RNAi 


a 
N Oo 
oa oO 


Stage 8 


Poe?1659 
Poe 420 


Hatch Rate (%) 
3 


4 7 10 
Gasvie Age (Days) 


B 8 100 a Normal 
> 
c Neural 
wu =O Stage 8 Stage 10B 
xo WT Poe WT_ Poe 
e Day 1 Day1 Day8 Day8 


Fig. 4. Poe is required for oocyte storage and neural development. 

(A) Poe mutation accelerates oocyte decline during storage. Error bars 
denote SD. (B) Poe mutant oocytes frequently fail to support normal neural 
development after prolonged storage. (©) POE antibody staining 

(see supplementary materials) during follicle development, showing 


Greenblatt et al., Science 361, 709-712 (2018) 17 August 2018 


TPM Fold Change 
Fmr1 RNAi/Control: 
mRNA-seq 

—_ Ribosome Footprinting 


* 0 05 


(End of follicle growth) 


a 


11.5 


Fold-change in Translation 
log, Fmr1 RNAi / Control 


Control RNAi 
Fmr1 RNAi 


7 


Huwe1 


salt 
oft 


) weet 4 8 1216 s alten 8 1216 ELE 
Nucleotide Position (kB) 


Normalized 
Read Depth 


01234 


FMRI1 (P < 0.01, t test). Protein size class is indicated by color. 

(D) Cumulative plot of translation (Fmr1 RNAi/control) as a function of 
protein size (left) or translational efficiency (TE, right) defined as 
ribosome-footprinting TPM(Fmrl RNAi)/mRNA-seq TPM. (E) Translation 
of large mRNAs in Fmr1 RNAi versus controls is reduced independent 

of TE. (F) Normalized read depth is plotted for two FMRI targets (Poe and 
Huwel) and two nontargets (Orb and Top2). In Fmr1 RNAi oocytes, 
target gene footprints are reduced at all positions along the mRNA. 


D 


Stage 10B 


WT... . ai Poe RNAi —_ Poe? 
Fmrié Emr RNAi Qe 
Fmri RNAi___ Fmrt°— __4 |poe OE = 


germline granules that arise in maturing follicles. Scale bars, 3 um. 

(D) Many POE granules are seen in wild-type stage 10 follicles but not 
in Poe RNAi, Poe®!®°?, Fmr1 RNAi, or null Fmr1/4°° follicles. Fmr1 RNAi 
combined with POE overexpression (OE) recovers POE granules. 

Scale bars, 20 um. 


3 of 4 


810z ‘9} IsnBny uo /6i0'beweoualos’eoue!0S//:d}]y Wo.) pepeojuMOGg 


RESEARCH | REPORT 


not only underlies fragile X syndrome but is 
relevant to other neurodevelopmental condi- 
tions. The same challenges likely exist in oocytes, 
spermatocytes, and non-neural somatic cells that 
require regulated translation from stored mRNAs. 
Because the challenges of translating large pro- 
teins are likely to increase in adult neurons under 
the influence of aging, the pathways and targets 
assayed here may contribute to adult-onset neural 
impairments such as schizophrenia and dementia. 
Improved knowledge of how FMRI1 preserves 
target translation, and of the identities of major 
target genes such as Poe/Ubr4, may open new 
opportunities to monitor susceptible cells and to 
intervene to mitigate declining levels of the most 
critical targets. Small-molecule agents that coun- 
teract the tendency of large mRNAs to be segre- 
gated into inactive granules represent potentially 
valuable therapeutics. Continued study of these 
highly conserved pathways in Drosophila repre- 
sents one powerful and efficient means to further 
address both the fundamental and applied im- 
plications of these findings. 
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vides a clean reload solution with easy one-handed loading. Tip Eject’s 
rigid plastic covering shields tips from the laboratory environment from 
the top down, keeping contamination out. It comes with clear casing and 
colored snap cards for easy tip identification. TipEject takes up one-fifth 
the bench space of a pack of racked tips. The package is recyclable; by 
reusing the base, plastic use is reduced by up to 44%. When you've got 
the need for speed and only one free hand, Tip Eject is the solution. 
Biotix 

For info: 888-330-2396 

biotix.com 


Gene Expression Panels 

nCounter Gene Expression Panels are developed in collaboration 
with leading researchers to deliver the most valuable, relevant panel 
content across applications, providing a highly multiplexed, simple 
workflow—no library prep or dilutions required, and just 15 min 
total hands-on time. Address key challenges in immuno-oncology by 
measuring multianalyte expression levels simultaneously within the 
tumor, the immune system, and the tumor microenvironment all on 
one platform, with NanoString’s 3D Biology Technology. nCounter 
Immunology, Inflammation, and Myeloid Panels are all designed 

to evaluate biomarkers associated with autoimmune diseases and 
can be customized with up to 30 additional user-defined genes. 
Furthermore, nCounter Immunology Panels are ideal for assessing 
infectious disease immune response and for pathogen identification. 
NanoString Technologies 

For info: 888-358-6266 

www.nanostring.com 


LIFE SCIENCE TECHNOLOGIES 


new products 


Solid-Phase Microextraction Tips 

SPE-it Tips are solid-phase microextraction devices configured to 
permit isolation of analytes from aqueous solutions followed by direct 
analysis without further sample processing. The fiber tips are solid- 
phase microextraction fibers embedded in a micropipette tip for easy 
handling. The fiber coating is composed of either C18 or PDMS/DVB 
bonded silica particles embedded in a biocompatible polymer. These 
tips can be manipulated manually or by robotic equipment, and are 
shipped in a 96-well format tray. The DART SPE-it kit includes a box 

of 96 SPE-it fibers (C18 or PDMS/DVB), a deep-well plate shaker, two 
SPE-it holders (each holds 12 fibers), a module for use with dynamic 
automated range technology (DART) automation hardware, an SPE-it 
Kit Place Mat with instructions, and 5 x 96 well plates provided with 
the initial kit. Additional fibers can be purchased separately in eight- 
pack units. 

lonSense 

For info: 781-484-1043 
ionsense.com/products/consumables/spe_it_tips 


Field Fluorometer 

The V-Lux Fluorometer is configured to provide high-quality in 

situ detection of algae, aromatic hydrocarbons, or tryptophan- 

like fluorescence. It includes three fluorescence channels as well 

as absorption, turbidity, and temperature channels, which allow 
corrections from these potential interferences. Applications include 
monitoring of road and airport apron runoff, bathing waters, and 
shellfish waters, plus discharge monitoring within the oil and gas 
sector. The fluorometer is packaged within a small, 50-mm diameter 
housing of 158-mm length, is rated to 6,000 m, and has integrated 
antibiofouling protection. It comes with an internal logger and 
provides real-time data with a choice of data output protocols 
including Modbus, SDI-12, and other digital formats, and includes 
quality control channels. It can be used as a handheld instrument, 
as part of a flow-through system, or deployed from gliders or 
underwater vehicles. 

Chelsea Technologies Group 

For info: +44-(0)-20-8481-9000 

www.chelsea.co.uk 


Surface Analysis System 

Materials researchers, product developers, and failure-analysis 
scientists seeking to conduct cost-effective, research-level X-ray 
photoelectron spectroscopy (XPS) studies can now benefit from 

the Thermo Scientific Nexsa surface analysis system. The system 

is designed to easily integrate multiple analytical techniques ina 
single, compact, fully automated surface analysis instrument. Nexsa 
combines the high throughput and high sensitivity of the K-Alpha 
XPS system with the multitechnique capabilities of the ESCALAB Xi* 
XPS microprobe. Nexsa users can add complementary techniques, 
such as Raman spectroscopy, ion scattering spectroscopy, 
reflected electron energy loss spectroscopy, and UV photoelectron 
spectroscopy, to generate multiple measurements from the same 
point on the sample. Integration of multiple techniques allows 
users to conduct true correlative analysis, unlocking the potential 
for further advances in microelectronics, ultrathin films, and 
nanomaterials development. 

Thermo Fisher Scientific 

For info: 800-532-4752 

www.thermofisher.com 


Electronically submit your new product description or product literature information! Go to www.sciencemag.org/about/new-products-section for more information. 


Newly offered instrumentation, apparatus, and laboratory materials of interest to researchers in all disciplines in academic, industrial, and governmental organizations are featured in this 
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Laser-Based Micropipette Puller 
DOES YOUR LAB ANALYZE © Pull quartz, borosilicate, and aluminosilicate glass 
THE MECHANISMS THAT IRAN a 


© Fully programmable, including heating filament characteristics 


MEDIATE COMMUNICATION © Pulls electrodes with tips less than 0.03 pm 


BETWEEN CELLS? © Optimized velocity sensing circuit 


© Long-life CO, laser 

© Write-protected for 
inadvertent changes 

© Date/time stamp and 
total heat time displayed 

© P-2000/F is ideal for 
nanospray and NSOM 


The P-2000 integrates a CO, laser-based heat source with the 
technology derived from our extensive experience with conventional 
pullers. This system offers capabilities unmatched by other pullers. 
A significant advance in the technology of fabrication 
ofmicropipettes, optical fiber probes, and nanospray tips, is 

offered with the P-2000 micropipette puller. 
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Find out more about the scope of the journal and submit PHONE: +1.415.883.0128 | FAX: +1.415.883.0572 
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SCIENCE.ORG 


Be a Force for Science 


TAKE ACTION 


Understand the science Learn ways you can 


behind the issues that Champion public become an advocate and 


matter. discussion and evidence- stand up for science. 


based policy. 


Present your work to the world 


Are you a representative of the upcoming generation of thought leaders 
in your field? Together we look forward to your application for the new 
Sartorius & Science Prize for Regenerative Medicine & Cell Therapy. 


Apply now! 

www.passionforscience.com/prize The Sartorius & Science 
Prize for Regenerative 
Medicine & Cell Therapy 


Awarded by sartorius 


online @sciencecareers.org 
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SCIENCE CAREERS 
ADVERTISING 


For full advertising details, 
go to ScienceCareers.org 
and click For Employers, 
or call one of our 
representatives. 


Science 


AMERICAS 


+1 202 326-6577 
+1 202 326-6578 
advertise@sciencecareers.org 


EUROPE, INDIA, AUSTRALIA, 
NEW ZEALAND, REST OF 
WORLD 


+44 (0) 1223 326527 
advertise@sciencecareers.org 


CHINA, KOREA, SINGAPORE, 
TAIWAN, THAILAND 


+86 131 4114 0012 
advertise@sciencecareers.org 


JAPAN 


+81 3-6459-4174 
advertise@sciencecareers.org 


CUSTOMER SERVICE 


AMERICAS 

+1 202 326-6577 
REST OF WORLD 
+44 (0) 1223 326528 


advertise@sciencecareers.org 


All ads submitted for publication must comply with 
applicable U.S. and non-U.S. laws. Science reserves 
the right to refuse any advertisement at its sole 
discretion for any reason, including without limitation 
for offensive language or inappropriate content, 

and all advertising is subject to publisher approval. 
Science encourages our readers to alert us to any ads 
that they feel may be discriminatory or offensive. 
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ScienceCareers.org 


RADCLIFFE INSTITUTE 
FOR ADVANCED STUDY 
HARVARD UNIVERSITY 


Academic Fellowships 


The Radcliffe Institute Fellowship Program 
at Harvard University welcomes fellowship 
applications in natural sciences and 
mathematics. The Radcliffe Institute for 
Advanced Study provides scientists the time 
and space to pursue their career’s best work. 
At Radcliffe you will have the opportunity 

to challenge yourself, meet and explore the 
work of colleagues in other fields, and take 
advantage of Harvard's many resources, 
including the extensive library system. The 
Radcliffe Institute Fellowship Program invites 
applications from people of all genders, 

and from all countries. We seek to build a 
diverse fellowship program. Scientists in any 
field who have a doctorate in the area of the 
proposed project (by December 2017) and 
at least two published articles or 
monographs are eligible to apply for a 
Radcliffe Institute fellowship. The stipend 
amount of $77,500 is meant to complement 
sabbatical leave salaries of faculty 
members. Residence in the Boston area and 
participation in the Institute community are 
required during the fellowship year. 


Applications for 2019-2020 are due by 
October 4, 2018. 

For more information, please visit 
www.radcliffe.harvard.edu 

or email 
sciencefellowships@radcliffe.harvard.edu. 


Search more 
jobs online 


Access hundreds 
of job postings on 
ScienceCareers.org. 


Expand your 
search today. 


Science 
Careers . 


AVAAAS 


Advance your career 


with expert advice from 


Science Careers. 


(3) Download Free Career Advice Booklets! 


ScienceCareers.org/booklets 


Featured Topics: 

= Networking 

= Industry or Academia 
= Job Searching 

= Non-Bench Careers 

= And More 
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OPPORTUNITIES IN BIOMEDICAL RESEARCH 
¢ Cancer Genetics 

¢ Cell Signaling & Regulation 

¢ Chemistry & Chemical Biology 

¢ Computational Biology 

¢ Developmental Biology 

¢ Experimental Therapeutics 

¢ Genome Integrity, Gene Expression, and Epigenetics 
¢ Immunology 

¢ Stem Cell Biology & Regenerative Medicine 

¢ Structural Biology 


JOIN OUR FACULTY 


Successful candidates will hold an appointment in one of SKI’s research programs. 
Candidates may apply to up to two programs: 


Chair: Scott Lowe, PhD Chair: John Petrini, PhD 
Chair: Dana Pe’er, PhD 

Chair: Kristian Helin, PhD Chair: David Scheinberg, MD, PhD 
Chair: Kathryn Anderson, PhD 

Chair: Derek Tan, PhD Chair: Nikola Pavletich, PhD 


Chair: Alexander Rudensky, PhD 


RESEARCH AND TRAINING 


¢ 130 research laboratories housed in state-of-the-art buildings 
¢ Core facilities offering cutting-edge scientific services 
* Over 800 pre- and post-doctoral trainees 


¢ Appointments in the Gerstner Sloan Kettering Graduate School of Biomedical 
Sciences and the Weill Cornell Graduate School of Medical Sciences 


Visit to learn more. 


Application deadline: | £ eS 
Apply at: ae Tay. 


MSK is an equal opportunity and affirmative action employer committed to diversity and inclusion in all aspects of recruiting and employment. 

All qualified individuals are encouraged to apply and will receive consideration without regard to race, color, gender, gender identity or expression, 
sexual orientation, national origin, age, religion, creed, disability, veteran status or any other factor which cannot lawfully be used as a basis for an 
employment decision. 


online @sciencecareers.org 
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Yale 


UNIVERSITY 


Department of Genetics 
Yale University School of Medicine 


Professor of Genetics and 
Director of the Center for Genomic Health 


The Department of Genetics at Yale University School of Medicine 
is searching for a Professor of Genetics with an outstanding record 
of transformative scientific achievements in Human Genetics 
and Genomics. We expect that the candidate will lead a vigorous 
cross-disciplinary research program focused on identifying and 
characterizing genetic drivers of human disease. As a leader of human 
genetics both within the department and across the Yale School of 
Medicine, the successful candidate will have the opportunity to 
recruit other human geneticists to the Genetics Department and lead 
a new program in precision medicine as the Scientific Director of the 
Yale Center for Personalized Medicine and Genomic Health. 


We are looking for a dynamic, internationally recognized scientist 
(Ph.D., M.D., or M.D./Ph.D.) with an outstanding research record of 
scientific discoveries, as well as a strong track record of training the 
best innovators in the field of human genetics and genomics. 


To apply, please submit your CV to http://apply.interfolio. 
com/45539 to the attention of Antonio Giraldez, Chair of Genetics. 
Applications will be reviewed until the position is filled. Inquiries 
should be addressed to neltja.brewster@yale.edu. 


Tenure-Track Faculty Position 
(Assistant/Associate/Full Professor) 


The Department of Genetics at Yale University School of Medicine 
invites applications for junior or senior tenure-track faculty positions. 
The search is open to investigators from all areas of biological and 
biomedical research. We are particularly interested in applicants 
working in one of the following areas: Developmental Biology, 
Imaging, Quantitative Biology, Computational Biology, Genomics, 
Systems Biology, and Genetics. Applications from investigators 
working at the interface of these areas will be strongly considered. 
The rank of the appointment will be commensurate with experience 
and the positions come with a substantial start-up package. 


The Department of Genetics comprises an exceptional group of 
31 primary basic science faculty with research interests including 
fundamental aspects of Developmental Biology, Genetics, Genomics 
and Epigenetics, using different model systems including flies, worms, 
fish and mouse, and humans (https://medicine.yale.edu/genetics/). 
The Department is closely associated with science initiatives at Yale 
including The Cancer Center, The Center for Neuroscience, The Stem 
Cell Center and the Yale Center for Genome Analysis. 


Candidate must hold a Ph.D., M.D., or equivalent degree. Applicants 
should upload a cover letter, a curriculum vitae, a description of 
previous research (1 page), a concise statement of research plans (up 
to 2 pages), reprints of 2 publications, and the names of 3 references 
to the Interfolio website at: https://apply.interfolio.com/52614. 
Specific inquiries about the position may be sent to the attention 
of Dr. Antonio Giraldez, Chair of the Department of Genetics, at 
genetics.admin@yale.edu. Applications will begin to be evaluated 
on November 1, 2018. 


Interviews will take place as part of a multidisciplinary symposium 
including candidates for different searches. Please reserve the dates 
of January 15, January 22 and February 11 (snow date) as potential 
dates for the symposium in case you are selected for an interview. 


Yale University is an Affirmative Action/Equal Opportunity 
Employer. Yale values diversity among its students, staff, and 
faculty and strongly welcomes applications from women, 
persons with disabilities, protected veterans, and 
underrepresented minorities. 


ICYS Research Fellow at ICYS, NIMS, Japan 


International Center for Young Scientists (ICYS) of National 
Institute for Materials Science (NIMS) is now seeking a few 
research fellows who have passion to conduct innovative 
materials research based on their original idea. ICYS research 
fellows are expected to pursue high quality research with their 
initiatives using the most advanced facilities in NIMS. 


ICYS offers a “Melting Pot” environment mixing different materials 
research fields and cultures. Common language in ICYS is 
English. Clerical and technical support in English will be given by 
the ICYS staffs. An annual salary of approximately 5.35 million 
yen will be offered, which may depend on the qualification and 
performance of the research fellow. Additional research grant of 
2 million yen per year will be supplied to each research fellow. 
The initial contract term is two years and may be extended by 
another year depending on one’s performance. 


All applicants must have obtained a PhD degree within the last 
ten years. Applicants should submit an application form including 
a research proposal to be conducted during the ICYS term, CV 
Header, CV with a list of publications and patents (be sure to 
attach the header), a list of DOI of journal publications following 
our instruction, PDF files of three significant publications to 
ICYS Recruitment Desk by September 27, 2018 JST. The format 
for application form and CV header can be downloaded from 
our website. Please visit our website for more details. 


ICYS Recruitment Desk, 
National Institute for Materials Science 
http:/www.nims.go.jp/icys/recruitment/index.html 


OPEN FACULTY POSITION 
SA INSTITUTE OF MOLECULAR BIOLOGY 
©. ACADEMIA SINICA, TAIWAN, ROC 


One tenure-track faculty position is open for a highly qualified individual 
to establish independent research programs in all disciplines of 
molecular and cellular biology. Applicants should hold a Ph.D. degree or 
its equivalent, with appropriate postdoctoral research experience. The 
successful recruit will be appointed at the levels of Assistant, Associate, 
or Full Research Fellows (equivalent to academic ranks of Assistant, 
Associate and Full Professors at universities), and receive a generous 
multi-year start-up package, followed by annual intramural support. 


The Institute of Molecular Biology at Academia Sinica (http://www. 
imb.sinica.edu.tw/en) provides an active and stimulating research 
environment, is well supported by both extramural and long-term 
intramural funding, and features several core facilities (cell imaging, 
microarray, Next Generation Sequencing, RNAi, electrophysiology, 
FACS, bioinformatics and mouse facilities) that provide state-of-the- 
art resources and key technical expertise to the Institute’s research 
community. Currently two Ph.D. programs, with one recruiting 
international students, are formally affiliated with the Institute. English 
is the official language for seminars and lectures at the Institute, and 
proficiency in Chinese language is not a prerequisite for application. 


Applicants should send electronic files of their Curriculum Vitae, a 
description of past research accomplishments and future research plans, 
and arrange for three letters of recommendation to be sent directly to 
Director Soo-Chen Cheng c/o Ms. Vivi Chiang (vivi@imb.sinica.edu.tw). 


The selection process will start on November 15, 2018 until the positions 
are filled. Further information can be obtained from Ms. Vivi Chiang at 
vivi@imb.sinica.edu.tw 


Northwestern University Feinberg School of Medicine Successful candidates will possess a MD, PhD, or MD/PhD 
invites applications and nominations for the inaugural degree and be eligible for a faculty appointment as a full- 
Director, Center for Immunobiology. The new Director will time Professor with scholarly accomplishments and interest 
report directly to the Dean of the medical school. in recruiting and developing a strong scientific team. 

In this role, the Director is responsible for oversight of the Candidate should be recognized investigators in human 
research and administrative affairs of the Center at the immunology with complementary strengths in basic and 
Feinberg School of Medicine. translational investigation. 

The Center for Immunobiology is an interdepartmental Email nominations and CVs of appropriate candidates to 
center with faculty who are engaged in a broad lla Allen, recruitment coordinator, at: 


spectrum of cutting-edge basic, translational, and clinical 
immunology-related research. Principal investigators 
appointed through the Feinberg School of Medicine are Apply online at: 

supported by $471 million of annual research funding. https://facultyrecruiting.northwestern.edu/apply/MTQz 
The medical school is embedded among three U.S. News & 
World Report Honor Roll hospitals; Northwestern Memorial 
Hospital is ranked 13th, Lurie Children’s Hospital 7th, and 
the Rehabilitation Institute of Chicago 1st. 


immunobiosearch@northwestern.edu 


Northwestern University is a 


n Equal eee IUIEN US eae 


with disabiliti 
contingen 


MW Northwestern 
Medicine 


U pdufich is one of the leading roccaren univer ciie in’ Europe. With ; a 500-year history 
j ‘of scholarly excellence, LMU boasts state- -of-the-art facilities, innovative research and 
N Lupwic- : transdisciplinary co- “operation across all ‘fields, from the sciences to the humanities. LMU is 
re @ based in Munich -.a d is an important part of G rmany’ s vibrant and ames science ands << 
MUNCHEN : ee hub: ; 


To provide outstanding junior researchers in all subject areas with a 


long-term career perspective, LMU is offering TEN U RE TRACK 


PROFESSORSHIPS TO SUCCESSFUL ERC 
STARTING GRANTEES of the European Research Council 


(for candidates 2-7 years after their PhD). 


If you are interested in joining LMU’s research community, please 
contact the appropriate faculty member in your field of research as soon 
as possible. For further information regarding this offer, please see 


www.|mu.de/excellent/erc-tenuretrack 


WORKING LIFE 


By Jeffrey Maloy 
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The universe in a classroom 


he universe is an unfathomably large place. But from the front of a lecture hall, it can feel 

suffocatingly tiny. Standing behind a podium at 8 a.m., I looked out into an auditorium at 

360 students; 720 eyes staring intently back at me; 3600 fingers furiously pounding away at 

keyboards, transcribing my uncertainty for posterity. Did I say “polymerase” when I meant to 

say “primase”? Was my answer to that question clear? The heat radiating from the projector felt 

stifling. With 5 minutes remaining, I fumbled an explanation. My chest tightened as I scanned 
the lecture hall, searching for an escape but finding only four walls that seemed to be inching closer. 
An eternity elapsed in 30 seconds. When I heard a chuckle (or was it a snicker?) from the corner 
of the room, I yielded to my inadequacy and dismissed class with a precious 2 minutes to spare, 
promising to rehash the topic at the beginning of the next class. 


Later that day, I recounted the 
gaffe to a colleague. “We’ve all been 
there,” she said sympathetically. 
Rationally, I knew she was being 
earnest. Yet I couldn’t bring myself 
to truly believe it. What if my mis- 
takes reflect a real deficiency as a 


teacher, a scientist, a scholar? Un- 
Ke) 


like her, I don’t have years of ex- | ° oo 


perience to validate my ability. The 
Ph.D. diploma that sits on top of 
my bookshelf is less than a year 
old. It was only my second time 
teaching my own large lecture 
course to undergraduates. 

Ever since my first experience 
as a teaching assistant in graduate 
school, I’ve looked forward to the 
day I would be the person stand- 
ing behind that podium. Now that 
Tm there, I can’t shake the feeling 
that some cosmic accident deliv- 
ered me. What qualifies me to stand in front of a classroom 
and explain the mysteries of biology to hundreds of eager 
students? Three recently earned letters behind my name? A 
carefully ironed blue Oxford? 

In my classes, I preach the gospel of facts and data. But 
intellectual knowledge and emotional conviction don’t al- 
ways match up. I’ve spent immeasurable hours convincing 
my students and peers that they are qualified, that they 
are good enough, that they are extraordinary. I worry that 
I may be the exception. Am I a qualified biologist? Am I a 
good enough mentor? Am I an extraordinary teacher? My 
degrees and teaching awards should theoretically provide 
some evidence of my abilities, but emotional conviction 
remains elusive. 

A few days later, as I lay in bed dwelling on my resume of 
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“What qualifies me to stand 
in front of a classroom [of] 
hundreds of eager students?” 


perceived failures and shortcom- 
ings, my phone buzzed: an email 
from a former student. “Dear Dr. 
Maloy,” it read. “Your class was 
my very first college lecture. You 
made an amazing impression 
with your kindness, approach- 
ability, and encouragement. I dis- 
covered that I love learning about 
the molecular processes of life so 
much that I really want to con- 
tinue learning more.” I smiled and 
dragged the email to my remind- 
ers folder, for reference in future 
moments of doubt. 

Maybe I am faking it. Maybe stu- 
dents can see through my facade of 
confidence in class every day as I 
navigate a landscape littered with 
landmines of uncertainty. And in 
the expanse of the universe, my 
classroom is just an infinitesimal 
stage. But I cannot brush off the small dent my existence 
has made in the experience of one student. One student 
who sat down at her computer on a Friday evening to say 
that I made a difference. 

The next morning at 8 o’clock, 360 students trickled 
into the classroom, the groan of folding plastic chairs 
announcing their arrival. I exhaled and allowed the clamor 
of the lecture hall to wash over me and drown out my 
doubts. I plugged my laptop into the projector and walked 
to the back of the classroom to greet the students. 
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Jeffrey Maloy is a lecturer and education research fellow 
at the University of California, Los Angeles. Do you have 
an interesting career story that you would like to share? 
Send it to SciCareerEditor @aaas.org. 
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